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1 TRANSPORTERS AND ION CHANNELS 

TECHNICAL FIELD 

" BACKGROUND OF THE INVENTION 
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various thyroid pathologies because it is the molecular basis for radioiodide thyroid-imaging techniques 
and for specific targeting of radioisotopes to the thyroid gland (Levy, O. et al. (1997) Proc. Natl. 
Acad. Sci. USA 94:5568-5573). SMVT is expressed in the intestinal mucosa, kidney, and placenta, 
and is implicated in the transport of the water-soluble vitamins, e.g., biotin and pantothenate (Prasad, 
5 P.D. et al. (1998) J. Biol. Chem. 273:7501-7506). 

One of the largest families of transporters is the major facilitator superfamily (MFS), also 
called the uniporter-symporter-antiporter family. MFS transporters are single polypeptide carriers that 
transport small solutes in response to ion gradients. Members of the MFS are found in all classes of 
living organisms, and include transporters for sugars, oligosaccharides, phosphates, nitrates, 

10 nucleosides, monocarboxylates, and drugs. MFS transporters found in eukaryotes all have a structure 
comprising 12 transmembrane segments (Pao, S.S. et al. (1998) Microbiol. Molec. Biol. Rev. 62:1-34). 
The largest family of MFS transporters is the sugar transporter family, which includes the seven 
glucose transporters (GLUT1 -GLUT7) found in humans that are required for the transport of glucose 
and other hexose sugars. These glucose transport proteins have unique tissue distributions and 

15 physiological functions. GLUT1 provides many cell types with their basal glucose requirements and 
transports glucose across epithelial and endothelial barrier tissues; GLUT2 facilitates glucose uptake 
or efflux from the liver; GLUT3 regulates glucose supply to neurons; GLUT4 is responsible for insulin- 
regulated glucose disposal; and GLUTS regulates fructose uptake into skeletal muscle. Defects in 
glucose transporters are involved in a recentty identified neurological syndrome causing infantile 

20 seizures and developmental delay, as well as glycogen storage disease, Fanconi-Bickel syndrome, and 
. non-insulin-dependent diabetes mellitus (Mueckler, M. (1994) Eur. J. Biochem. 219:713-725; Longo, 
N. and L.J. Elsas (1998) Adv. Pediatr. 45:293-313). 

Monocarboxylate anion transporters are proton-coupled symporters with a broad substrate 
specificity that includes L-lactate, pyruvate, and the ketone bodies acetate, acetoacetate, and 

25 beta-hydroxybutyrate. At least seven isoforms have been identified to date. The isoforms are 

predicted to have twelve transmembrane (TM) helical domains with a large intracellular loop between 
TM6 and TM7, and play a critical role in maintaining intracellular pH by removing the protons that are 
produced stoichiometrically with lactate during glycolysis. The best characterized 
H + -monocarboxylate transporter is that of the erythrocyte membrane, which transports L-lactate and a 

30 wide range of other aliphatic monocarboxylates. Other cells possess HMinked monocarboxylate 

transporters with differing substrate and inhibitor selectivities. In particular, cardiac muscle and tumor 
cells have transporters that differ in their values for certain substrates, including stereoselectivity 
for L- over D-lactate, and in their sensitivity to inhibitors. There are Na + -monocarboxylate 



2 



WO 03/083085 

PCT/US03/09797 

cotransporters on the luminal surface of intestinal and kidney epithelia, which allow the uptake of 
lactate, pyruvate, and ketone bodies in these tissues. In addition, there are specific and selective 
transporters for organic cations and organic anions in organs including the kidney, intestine and liver. 
Organic anion transporters are selective for hydrophobic, charged molecules with electron-attracting 
5 side groups. Organic cation transporters, such as the ammonium transporter, mediate the secretion of 
a variety of drugs and endogenous metabolites, and contribute to the maintenance of intercellular pH 
(Poole, R.C. and A.P. Halestrap (1993) Am. J. Physiol. 264:C761-C782; Price, N.T. et al. (1998) 
Biochem. J. 329:321-328; and Martinelle, K. and I. Haggstrom (1993) J. Biotechnol. 30:339-350). 
ATP-binding cassette (ABC) transporters are members of a superfamily of membrane 
10 proteins that transport substances ranging from small molecules such as ions, sugars, amino acids, 
peptides, and phospholipids, to lipopeptides, large proteins, and complex hydrophobic drugs. ABC 
transporters consist of four modules: two nucleotide-binding domains (NBD), which hydrolyze ATP to 
supply the energy required for transport, and two membrane-spanning domains (MSD), each 
containing six putative transmembrane segments. These four modules may be encoded by a single . 
15. gene, as is the case for the cystic fibrosis ^smembrane regulator (CFTR), or by separate genes. 
When encoded by separate genes, each gene product contains a singleNBD and MSD. These "half- . 
molecules" form homo- and heterodimers, such as Tapl and Tap2, the endoplasmic reticulum-based 
major histocompatibility (MHC) peptide transport system. Several genetic diseases are attributed to 
. defects in ABC transporters, such as the following diseases and their corresponding proteins: cystic 
20 fibrosis (CFTR, an ion channel), adrenoleukodystrophy (adrenoleukodystrophy protein, ALDP), 

Zellweger syndrome (peroxisomal membrane protein-70, PMP70), and hyperinsulinemic hypoglycemia 
(sulfonylurea receptor, SUR). Overexpression of the multidrug resistance (MDR) protein, another 
ABC transporter, in human cancer cells makes the cells resistant to a variety of cytotoxic drugs used 
in chemotherapy (Taglicht, D. and S. Michaelis (1998) Meth. Enzymol. 292:130-162). 
25 A number of metal ions such as iron, zinc, copper, cobalt, manganese, molybdenum, selenium, 

nickel, and chromium are important as cofactors for a number of enzymes. For example, copper is 
involved in hemoglobin synthesis, connective tissue metabolism, and bone development, by acting as a 
cofactor in oxidoreductases such as superoxide dismutase, ferroxidase (ceruloplasmin), and lysyl & 
oxidase. Copper and other metal ions must be provided in the diet, and are absorbed by transporters in 
30 the gastrointestinal tract. Plasma proteins transport the metal ions to the liver and other target organs, 
where specific transporters move the ions into cells and cellular organelles as needed. Imbalances in' 
metal ion metabolism have been associated with a number of disease states (Danks, D.M. (1986) J. 
Med. Genet. 23:99-106). 
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Transport of fatty acids across the plasma membrane can occur by diffusion, a high capacity, 
low affinity process. However, under normal physiological conditions a significant fraction of fatty 
acid transport appears to occur via a high affinity, low capacity protein-mediated transport process. 
Fatty acid transport protein (FATP), an integral membrane protein with four transmembrane 
5 segments, is expressed in tissues exhibiting high levels of plasma membrane fatty acid flux, such as 
muscle, heart, and adipose. Expression of FATP is upregulated in 3T3-L1 cells during adipose 
conversion, and expression in COS7 fibroblasts elevates uptake of long-chain fatty acids (Hui, T. Y. et 
aL (1998) J. Biol. Chem. 273:27420-27429). 

Hie lipocalin superfamily constitutes a phylogenetically conserved group of more than forty 
10 proteins that function as extracellular ligand-binding proteins which bind and transport small 
hydrophobic molecules. Members of this family function as carriers of retinoids, odorants, 
chromophores, pheromones, allergens, and sterols, and in a variety of processes including nutrient 
transport, cell growth regulation, immune response, and prostaglandin synthesis. A subset of these 
proteins may be multifunctional, serving as either a biosynthetic enzyme or as a specific ienzyme 
15 inhibitor. (Tanaka, T. et al. (1997) J. Biol. Chem. 272:15789-15795; and van't Hof, W. et al. (1997) J. 
Biol. Chem. 272:1837-1841.) 

Members of the lipocalin family display unusually low levels of overall sequence conservation. 
Pairwise sequence identity often falls below 20%. Sequence similarity between family members is 
limited to conserved cysteines which form disulfide bonds and three motifs which form a juxtaposed 
- 20 cluster that functions as a target cell recognition site. The lipocalins share an eight stranded, anti- 
parallel beta-sheet which folds back on itself to form a continuously hydrogen-bonded beta-barrel. 
The pocket formed by the barrel functions as an internal ligand binding site. Seven loops (LI to L7) 
form short beta-hairpins, except loop LI which is a large omega loop that forms a lid to partially close 
the internal ligand-binding site (Flower (1996) Biochem. J. 318:1-14). 
25 Lipocalins are important transport molecules. Each lipocalin associates with a particular 

ligand and delivers that ligand to appropriate target sites within the organism. Retinol-binding protein 
(RBP), one of the best characterized lipocalins, transports retinol from stores within the liver to target 
tissues. Apolipoprotein D (apo D), a component of high density lipoproteins (HDLs) and low density 
lipoproteins (LDLs), functions in the targeted collection and delivery of cholesterol throughout the 
30 body. Lipocalins are also involved in cell regulatory processes. Apo D, which is identical to gross- 
cystic-disease-fluid protein (GCDFP)-24, is a progesterone/pregnenolone-binding protein expressed at 
high levels in breast cyst fluid. Secretion of apo D in certain human breast cancer cell lines is 
accompanied by reduced cell proliferation and progression of cells to a more differentiated phenotype. 
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Similarly, apo D and another lipocalin, a,-acid glycoprotein (AGP), are involved in nerve cell 
regeneration. AGP is also involved in anti-inflammatory and immunosuppressive activities. AGP is 
one of the positive acute-phase proteins (APP); circulating levels of AGP increase in response to 
stress and inflammatory stimulation. AGP accumulates at sites of inflammation where it inhibits 
5 Platelet and neutrophil activation and inhibits phagocytosis. The immunomodulatory properties of AGP 
are due to glycosylate. AGP is 40% carbohydrate, making it unusually acidic and soluble. The 
glycosylation pattern of AGP changes during acute-phase response, and deglycosylated AGP has no 
immunosuppressive activity (Flower (1994) FEBS Lett. 354:7-11; Flower (1996) supra). 

The lipocalin superfamily also includes several animal allergens, including the mouse major 
10 urinary protein (mMUP), the rat a-2-microgloobulin (rA2U), the bovine P-lactoglobulin (pig), the 
cockroach allergen (Bla g 4), bovine dander allergen (Bos d2), and the major horse allergen, designated 
Equuscaballus allergen 1 (Equcl). Equcl is a powerful allergen responsible for about 80% of anti- 
horse IgE antibody response in patients who are chronically exposed to horse allergens. It appears, 
that lipocalins may contain a common structure that is able to induce the IgE response (Gregoire, C. et 
15 al., (1996) J. Biol. Chem. 271:32951-32959). 

Lipocalins are used as diagnostic and prognostic markers in a variety of disease states. The • 
plasma level of AGP is momtoreddurmgpregn^ . 
including cancer chemotherapy, renal disfunction, myocardial infarction, arthritis, and multiple 
sclerosis. RBP is used clinically as a marker of tubular reabsorption in the kidney, and apo D is a . 
20 marker in gross cystic breast disease (Flower (1996) supra). Additionally, the use of lipocalin animal 
allergens may help in the diagnosis of allergic reactions to horses (Gregoire supra), pigs, cockroaches, 
mice and rats. 

Mitochondrial carrier proteins are transmembrane-spanning proteins which transport ions and 
charged metabolites between the cytosol and the mitochondrial matrix. Examples include the ADP, 

25 ATP carrier protem; me 2-oxoglutarate/^^^ 

carrier, the dicarboxylate carrier which transports malate, succinate, fumarate, and phosphate; the 
tricarboxylate carrier which transports citrate and malate; and the Grave's disease carrier protein, a 
protein recognized by IgG in patients with active Grave's disease, an autoimmune disorder resulting in 
hyperthyroidism. Proteins in this family consist of three tandem repeats of an approximately 100 

30 amino acid domain, each of which contains two transmembrane regions (Stryer, L. (1995) 
Biochemistry ,, W.H. Freeman and Company, New York NY, p. 551; PROS1TE PDOC00189 
Mitochondrial energy transfer proteins signature; Online Mendelian Inheritance in Man (OM1M) 
*275000 Graves Disease). 
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This class of transporters also includes the mitochondrial uncoupling proteins, which create 
proton leaks across the inner mitochondrial membrane, thus uncoupling oxidative phosphorylation from 
ATP synthesis. The result is energy dissipation in the form of heat. Mitochondrial uncoupling proteins 
have been implicated as modulators of thermoregulation and metabolic rate, and have been proposed 
5 as potential targets for drugs against metabolic diseases such as obesity (Ricquier, D. et al. (1999) J. 
Int Med. 245:637-642). 
Ion Channels 

The electrical potential of a cell is generated and maintained by controlling the movement of 
ions across the plasma membrane. The movement of ions requires ion channels, which form ion- 

10 selective pores within the membrane. There are two basic types of ion channels, ion transporters and 
gated ion channels. Ion transporters utilize the energy obtained from ATP hydrolysis to actively 
transport an ion against the ion's concentration gradient. Gated ion channels allow passive flow of an 
ion down the ion's electrochemical gradient under restricted conditions. Together, these types of ion 
channels generate, maintain, and utilize an electrochemical gradient that is used in 1) electrical impulse 

15 conduction down the axon of a nerve cell, 2) transport of molecules into cells against concentration 
gradients, 3) initiation of muscle contraction, and 4) endocrine cell secretion. 
Ion Transporters 

Ion transporters generate and maintain the resting electrical potential of a cell. Utilizing the 
energy derived from ATP hydrolysis, they transport ions against the ion's concentration gradient. 
: 20 These transmembrane ATPases are divided into three families. The phosphorylated (P) class ion 
transporters, including Na + -K + ATPase, Ca 2+ -ATPase, and H + -ATPase, are activated by a 
phosphorylation event. P-class ion transporters are responsible for maintaining resting potential 
distributions such that cytosolic concentrations of Na + and Ca 2 * are low and cytosolic concentration of 
K + is high. The vacuolar (V) class of ion transporters includes H + pumps on intracellular organelles, 

25 such as lysosomes and Golgi. V-class ion transporters are responsible for generating the low pH 
within the lumen of these organelles that is required for function. The coupling factor (F) class 
consists of H + pumps in the mitochondria. F-class ion transporters utilize a proton gradient to generate 
ATP from ADP and inorganic phosphate (Pi). 

The P-ATPases are hexamers of a 100 kD subunit with ten transmembrane domains and 

30 several large cytoplasmic regions that may play a role in ion binding (Scarborough, G.A. (1999) Curr. 
Opin. Cell Biol. 11:517-522). The V-ATPases are composed of two functional domains: the Y } 
domain, a peripheral complex responsible for ATP hydrolysis; and the V 0 domain, an integral complex 
responsible for proton translocation across the membrane. The F-ATPases are structurally and 
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evolutionarily related to the V-ATPases. The F-ATPase F 0 domain contains 12 copies of the c 
subunit, a highly hydrophobic protein composed of two transmembrane domains and containing a single 
buried carboxyl group in TM2 that is essential for proton transport. The V-ATPase V 0 domain 
contains three types of homologous c subunits with four or five transmembrane domains and the 
5 essential carboxyl group in TM4 or TM3. Both types of complex also contain a single a subunit that 
may be involved in regulating the pH dependence of activity (Forgac, M. (1999) J. Biol. Chem. 
274:12951-12954). 

Hie resting potential of the cell is utilized in many processes involving carrier proteins and 
gated ion channels. Carrier proteins utilize the resting potential to transport molecules into and out of 
10 the cell. Amino acid and glucose transport into many cells is linked to sodium ion co-transport 

(symport) so that the movement of Na + down an electrochemical gradient drives transport of the other 
molecule up a concentration gradient. Similarly, cardiac muscle links transfer of Ca 2 * out of the cell 
with transport of Na + into the cell (antiport). 
Gated Ion Channels 

15 Gated ion channels control ion flow by regulating the opening and closing of pores. The ability 

to control ion flux through various gating mechanisms allows ion channels to mediate such diverse 
signaling and homeostatic functions as neuronal and endocrine signaling, muscle contraction, 
. fertilization, and regulation of ion and pH balance. Gated, ion channels are categorized according to 
the manner of regulating the gating function. Mechanically-gated channels open their pores in 

20 response to mechanical stress; voltage-gated channels (e.g., Na + , K + , Ca 2+ , and CY channels) open . 
their pores in response to changes in membrane potential; and ligand-gated channels (e.g., 
acetylcholine-, serotonin-, and glutamate-gated cation channels, and GABA- and glycine-gated 
chloride channels) open their pores in the presence of a specific ion, nucleotide, or neurotransmitter. 
The gating properties of a particular ion channel (i.e., its threshold for and duration of opening and 

25 closing) are sometimes modulated by association with auxiliary channel proteins and/or post 
translational modifications, such as phosphorylation. 

Mechanically-gated or mechanosensitive ion channels act as transducers for the senses of 
touch, hearing, and balance, and also play important roles in cell volume regulation) smooth muscle 
contraction, and cardiac rhythm generation. A stretch-inactivated channel (SIC) was recentiy cloned 

30 from rat kidney. The SIC channel belongs to a group of channels which are activated by pressure or 
stress on the cell membrane and conduct both Ca 2+ and Na + (Suzuki, M. et al. (1999) J. Biol. Chem. 
274:6330-6335). 

The pore-forming subunits of the voltage-gated cation channels form a superfamily of ion 
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channel proteins. The characteristic domain of these channel proteins comprises six transmembrane 
domains (S1-S6), a pore-forming region (P) located between S5 and S6, and intracellular amino and 
carboxy termini. In the Na + and Ca 2+ subfamilies, this domain is repeated four times, while in the K + 
channel subfamily, each channel is formed from a tetramer of either identical or dissimilar subunits. 
5 The P region contains information specifying the ion selectivity for the channel. In the case of K + 
channels, a GYG tripeptide is involved in this selectivity (Ishii, T.M. et al. (1997) Proc. Natl. Acad. 
Sci. USA 94:11651-11656). 

Voltage-gated Na + and K + channels are necessary for the function of electrically excitable 
cells, such as nerve and muscle cells. Action potentials, which lead to neurotransmitter release and 

10 muscle contraction, arise from large, transient changes in the permeability of the membrane to Na + 
and K + ions. Depolarization of the membrane beyond the threshold level opens voltage-gated Na + 
channels. Sodium ions flow into the cell, further depolarizing the membrane and opening more 
voltage-gated Na + channels, which propagates the depolarization down the length of the cell. 
Depolarization also opens voltage-gated potassium channels. Consequendy, potassium ions flow 

15 outward, which leads to repolarization of the membrane. Voltage-gated channels utilize charged ; 
residues in the fourth transmembrane segment (S4) to sense voltage change. The open state lasts 
only about 1 millisecond, at which time the channel spontaneously converts into an inactive state that 
cannot be opened irrespective of the membrane potential. Inactivation is mediated by the channel's 
N-terminus, which acts as a plug that closes the pore. The transition from an inactive to a closed state . 

20 requires a return to resting potential. 

Voltage-gated Na+ channels are heterotrimeric complexes composed of a 260 kDa pore- 
forming a subunit that associates with two smaller auxiliary subunits, pi and 02. The 02 subunit is a 
integral membrane glycoprotein that contains an extracellular Ig domain, and its association with a and 
PI subunits correlates with increased functional expression of the channel, a change in its gating 

25 properties, as well as an increase in whole cell capacitance due to an increase in membrane surface 
area (Isom, L.L. et al. (1995) Cell 83:433-442). 

Non voltage-gated Na + channels include the members of the amiloride-sensitive Na + 
channel/degenerin (NaC/DEG) family. Channel subunits of this family are thought to consist of two 
transmembrane domains flanking a long extracellular loop, with the amino and carboxyl termini located 

30 within the cell. The NaC/DEG family includes the epithelial Na + channel (ENaC) involved in Na + 
reabsorption in epithelia including the airway, distal colon, cortical collecting duct of the kidney, and 
exocrine duct glands. Mutations in ENaC result in pseudohypoaldosteronism type 1 and Liddle's 
syndrome (pseudohyperaldosteronism). The NaC/DEG family also includes the recentiy characterized 



8 



WO 03/083085 



PCT/US03/09797 



H + -gated cation channels or acid-sensing ion channels (ASIC). ASIC subunits are expressed in the 
brain and form heteromultimeric Na + -permeable channels. These channels require acid pH 
fluctuations for activation. ASIC subunits show homology to the degenerins, a family of mechanically- 
gated channels originally isolated from C. elegans. Mutations in the degenerins cause 
5 neurodegeneration. ASIC subunits may also have a role in neuronal function, or in pain perception, 
since tissue acidosis causes pain (Waldmann, R. and M. Lazdunski (1998) Curr. Opin. Neurobiol. 
8:418-424; Eglen, R.M. et al. (1999) Trends Pharmacol. Sci. 20:337-342). 

K + channels are located in all cell types, and may be regulated by voltage, ATP concentration, 
or second messengers such as Ca 2+ and cAMP. In non-excitable tissue, K + channels are involved in 

10 protein synthesis, control of endocrine secretions, and the maintenance of osmotic equilibrium across 
membranes. In neurons and other excitable cells, in addition to regulating action potentials and 
repolarizing membranes, K + channels are responsible for setting the resting membrane potential. The 
cytosol contains non-diffusible anions and, to balance this net negative charge, the cell contains a Na + - 
K + pump and ion channels that provide the redistribution of Na + , K + , and Cl\ The pump actively 

15 transports Na + out of the cell and K + into the cell in a 3:2 ratio. Ion channels in the plasma membrane 
allow K + and Cr to flow by passive diffusion. Because of the high negative charge within the cytosol, 
CI " flows out of the cell. The flow of K + is balanced by an electromotive force p ullin g K + into the cell, 
and a K + concentration gradient pushing K + out of the cell. Thus, the resting membrane potential is 
primarily regulated by K*flow (Salkoff, L. and T. Jegla (1995) Neuron 15:489-492), 

20 Potassium channel subunits of the SliakerAikQ superfamily all have the characteristic six 

transmembrane/1 pore domain structure. Four subunits combine as homo- or heterotetramers to form 
functional K channels. These pore-forming subunits also associate with various cytoplasmic P 
subunits that alter channel inactivation kinetics. The ShakerAikz channel family includes the voltage- 
gated K + channels as well as the delayed rectifier type channels such as the human ether-a-go-go 

25 related gene (HERG) associated with long QT, a cardiac dysrythmia syndrome (Curran, M.E. (1998) 
Curr. Opin. Biotechnol. 9:565-572; Kaczorowski, G.J. andMJL. Garcia (1999) Curr. Opin. Chem. 
Biol. 3:448-458). 

A second superfamily of K + channels is composed of the inward rectifying channels (Kir). 
Kir channels have the property of preferentially conducting K + currents in the inward direction. These 
30 proteins consist of a single potassium selective pore domain and two transmembrane domains, which 
correspond to the fifth and sixth transmembrane domains of voltage-gated K + channels. Kir subunits 
also associate as tetramers. The Kir family includes ROMK1, mutations in which lead to Bartter 
syndrome, a renal tubular disorder. Kir channels are also involved in regulation of cardiac pacemaker 
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activity, seizures and epilepsy, and insulin regulation (Doupnik, C.A. et al. (1995) Curr. Opin. 
Neurobiol. 5:268-277; Curran, supra). 

The recendy recognized TWIK K + channel family includes the mammalian TWIK-1, TREK-1 
and TASK proteins. Members of this family possess an overall structure with four transmembrane 

5 domains and two P domains. These proteins are probably involved in controlling the resting potential 
in a large set of cell types (Duprat, F. et al. (1997) EMBO J 16:5464-5471). 

The voltage-gated Ca 24 * channels have been classified into several subtypes based upon their 
electrophysiological and pharmacological characteristics. L-type Ca 2 * channels are predominantly 
expressed in heart and skeletal muscle where they play an essential role in excitation-contraction 

10 coupling. T-type channels are important for cardiac pacemaker activity, while N-type and P/Q-type 
channels are involved in the control of neurotransmitter release in the central and peripheral nervous 
system. The L-type and N-type voltage-gated Ca 2+ channels have been purified and, though their 
functions differ dramatically, they have similar subunit compositions. The channels are composed of 
three subunits. The a r subunit forms the membrane pore and voltage sensor, while the o^S and p 
• 15 subunits. modulate the voltage-dependence, gating properties, and the current amplitude of the channel. 
These subunits are encoded by at least six a lf one afi, and four P genes. A fourth subunit, 7, has 
been identified in skeletal muscle (Walker, D. et al. (1998) J. Biol. Chem. 273:2361-2367; McCleskey, 
E.W. (1994) Curr. Opin. Neurobiol. 4:304-312). 

The high- voltage-activated Ca 24 channels that have been characterized biochemically include 

20 complexes of a pore-forming alphal subunit of approximately 190-250 kDa; a transmembrane - 
complex of alpba2 and delta subunits; an intracellular beta subunit; and in some cases a 
transmembrane gamma subunit. A variety of alphal subunits, alpha2delta complexes, beta subunits, 
and gamma subunits are known. The Cavl family of alphal subunits conduct L-type Ca 24 currents, 
which initiate muscle contraction, endocrine secretion, and gene transcription, and are regulated 

25 primarily by second messenger-activated protein phosphorylation pathways. The Cav2 family of 
alphal subunits conduct N-type, P/Q-type, and R-type Ca 2+ currents, which initiate rapid synaptic 
transmission and are regulated primarily by direct interaction with G proteins and SNARE proteins and 
secondarily by protein phosphorylation. The Cav3 family of alphal subunits conduct T-type Ca 2+ 
currents, which are activated and inactivated more rapidly and at more negative membrane potentials 

30 than other Ca 2+ current types. The distinct structures and patterns of regulation of these three 
families of Ca 24 channels provide an array of Ca ^ entry pathways in response to changes in 
membrane potential and a range of possibilities for regulation of Ca 2+ entry by second messenger 
pathways and interacting proteins (Catterall, W.A. (2000) Annu. Rev. Cell Dev. Biol. 16:521-555). 
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Hie alpha-2 subunit of the voltage-gated Ca ^-channel may include one or more Cache 
domains. An extracellular Cache domain may be fused to an intracellular catalytic domain, such as 
the histidine kinase, PP2C phosphatase, GGDEF (a predicted diguanylate cyclase), HD-GYP (a 
predicted phosphodiesterase) or adenylyl cyclase domain, or to a noncatalytic domain, like the 
5 methyl-accepting, DNA-binding winged helix-turn-helix, GAF, PAS or HAMP (a domain found in 
istidine kinases, denylyl cyclases, ethyl-binding proteins and phosphatases). Small molecules are bound 
via the Cache domain and this signal is converted into diverse outputs depending on the intracellular 
domains (Anantharaman, V. and Aravind, L.(2000) Trends Biochem. Sci. 25:535-537). 

The transient receptor family (Tip) of calcium ion channels are thought to mediate 
10 capacitative calcium entry (CCE). CCE is the Ca 2+ influx into cells to resupply Ca 2+ stores depleted 
by the action of inositol triphosphate (IP3) and other agents in response to numerous hormones and 
growth factors. Tip and Trp-like were first cloned from Drosophila and have similarity to voltage 
gated Ca 2+ channels in the S3 through S6 regions. This suggests that Trp and/or related proteins may 
form mammalian CCE channels (Zhu, X. et al. (1996) Cell 85:661-671; Boulay, G. et al. (1997) J. Biol. 
15 Chem. 272:29672-29680). Melastatin is a gene isolated in both the mouse and human, whose 
expression in melanoma cells is inversely correlated with melanoma aggressiveness in vivo. The 
human cDNA transcript corresponds to a 1533-amino acid protein having homology to members of the 
Trp family. It has been proposed that the combined use of malastatin mRNA expression status and 
tumor thickness might allow for the determination of subgroups of patients at both low and high risk 
20 for developing metastatic disease (Duncan, L.M. et al (2001) J. Clin. Oncol. 19:568-576). 

Chloride channels are necessary in endocrine secretion and in regulation of cytosolic and 
organelle pH. In secretory epithelial cells, CI" enters the cell across a basolateral membrane through 
an Na + , K + /C1" cotransporter, accumulating in the cell above its electrochemical equilibrium 
concentration. Secretion of CI " from the apical surface, in response to hormonal stimulation, leads to 
25 flow of Na * and water into the secretory lumen. The cystic fibrosis transmembrane conductance 

regulator (CFTR) is a chloride channel encoded by the gene for cystic fibrosis, a common fatal genetic 
disorder in humans. CFTR is a member of the ABC transporter family, and is composed of two 
domains each consisting of six transmembrane domains followed by a nucleotide-binding site. Loss of 
CFTR function decreases transepithelial water secretion and, as a result, the layers of mucus that coat 
30 the respiratory tree, pancreatic ducts, and intestine are dehydrated and difficult to clear. The resulting 
blockage of these sites leads to pancreatic insufficiency, "meconium ileus", and devastating "chronic 
obstructive pulmonary disease" (Al-Awqati, Q. et al. (1992) I. Exp. Biol. 172:245-266). 

The voltage-gated chloride channels (CLC) are characterized by 10-12 transmembrane 
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domains, as well as two small globular domains known as CBS domains. The CLC subunits probably 
function as homotetramers. CLC proteins are involved in regulation of cell volume, membrane 
potential stabilization, signal transduction, and transepithelial transport. Mutations in CLC-1, expressed 
predominantly in skeletal muscle, are responsible for autosomal recessive generalized myotonia and 
5 autosomal dominant myotonia congenita, while mutations in the kidney channel CLC-5 lead to kidney 
stones (Jentsch, TJ. (1996) Curr. Opin. NeurobioL 6:303-310). 

Ligand-gated channels open their pores when an extracellular or intracellular mediator binds to 
the channel. Neurotransmitter-gated channels are channels that open when a neurotransmitter binds 
to their extracellular domain. These channels exist in the postsynaptic membrane of nerve or muscle 

10 cells. There are two types of neurotransmitter-gated channels. Sodium channels open in response to 
excitatory neurotransmitters, such as acetylcholine, glutamate, and serotonin. This opening causes an 
influx of Na + and produces the initial localized depolarization that activates the voltage-gated channels 
and starts the action potential. Chloride channels open in response to inhibitory neurotransmitters, 
such as y-aminobutyric acid (GAB A) and glycine, leading to hyperpolarization of the membrane and 

15 the subsequent generation of an action potential. Neurotransmitter-gated ion channels have four 
transmembrane domains and probably function as pentamers (Jentsch, supra). Amino acids in the 
second transmembrane domain appear to be important in determining channel permeation and 
selectivity (Sather, W.A. et al. (1994) Curr. Opin. Neurobiol. 4:313-323). 

Ligand-gated channels can be regulated by intracellular second messengers. For example, 

20 calcium-activated K + channels are gated by internal calcium ions. In nerve cells, an influx of calcium 
during depolarization opens K + channels to modulate the magnitude of the action potential (Ishi et al., 
supra). The large conductance (BK) channel has been purified from brain and its subunit composition 
determined. The a subunit of the BK channel has seven rather than six transmembrane domains in 
contrast to voltage-gated K + channels. The extra transmembrane domain is located at the subunit N- 

25 terminus. A 28-amino-acid stretch in the C-tenninal region of the subunit (the "calcium bowl" region) 
contains many negatively charged residues and is thought to be the region responsible for calcium 
binding. The |3 subunit consists of two transmembrane domains connected by a glycosylated 
extracellular loop, with intracellular N- and C-termini (Kaczorowski, supra; Vergara, C. et al. (1998) 
Curr. Opin. Neurobiol. 8:321-329). 

30 Cyclic nucleotide-gated (CNG) channels are gated by cytosolic cyclic nucleotides. The best 

examples of these are the cAMP-gated Na + channels involved in olfaction and the cGMP-gated 
cation channels involved in vision. Both systems involve ligand-mediated activation of a G-protein 
coupled receptor which then alters the level of cyclic nucleotide within the cell. CNG channels also 
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represent a major pathway for Ca 2+ entry into neurons, and play roles in neuronal development and 
plasticity. CNG channels are tetramers containing at least two types of subunits, an a subunit which 
can form functional homomeric channels, and a p subunit, which modulates the channel properties. 
All CNG subunits have six transmembrane domains and a pore forming region between the fifth and 
5 sixth transmembrane domains, similar to voltage-gated K + channels. A large C-tenninal domain 
contains a cyclic nucleotide binding domain, while the N-terminal domain confers variation among 
channel subtypes (Zufall, F. et al. (1997) Curr. Opin. Neurobiol. 7:404-412). 

The activity of other types of ion channel proteins may also be modulated by a variety of 
intracellular signaling proteins. Many channels have sites for phosphorylation by one or more protein 
10 kinases including protein kinase A, protein kinase C, tyrosine kinase, and casein kinase U, all of which 
regulate ion channel activity in cells. Kir channels are activated by the binding of the G0y subunits of 
heterotrimeric G-proteins (Reimann, F. and F.M. Ashcroft (1999) Curr. Opin. Cell. Biol. 11:503-508). 
Other proteins are involved in the localization of ion channels to specific sites in the cell membrane. 
Such proteins include the PDZ domain proteins known as MAGUKs (membrane-associated guanylate 
15 kinases) which regulate the clustering of ion channels at neuronal synapses (Craven, S.E. and D.S. 
Bredt (1998) Cell 93:495-498). 
Disease Correlation 

The etiology of numerous human diseases and disorders can be attributed to defects in the 
transport of molecules across membranes. Defects in the trafficking of membrane-bound transporters 

20 and ion channels are associated with several disorders, e.g., cystic fibrosis, glucose-galactose 
malabsorption syndrome, hypercholesterolemia, von Gierke disease, and certain forms of diabetes 
mellitus. Single-gene defect diseases resulting in an inability to transport small molecules across 
membranes include, e.g., cystinuria, iminoglycinuria, Hartup disease, andFanconi disease (van't Hoff, 
W.G. (1996) Exp. Nephrol. 4:253-262; Talente, G.M. et al. (1994) Ann. Intern. Med. 120:218-226; 

25 and Chillon, M. et al. (1995) New Engl. J.Med. 332:1475-1480). 

Human diseases caused by mutations in ion channel genes include disorders of skeletal 
muscle, cardiac muscle, and the central nervous system. Mutations in the pore-forming subunits of 
sodium and chloride channels cause myotonia, a muscle disorder in which relaxation after voluntary 
contraction is delayed. Sodium channel myotonias have been treated with channel blockers. 
30 Mutations in muscle sodium and calcium channels cause fonns of periodic paralysis, while mutations in 
the sarcoplasmic calcium release channel, T-tubule calcium channel, and muscle sodium channel 
cause malignant hyperthermia. Cardiac arrythmia disorders such as the long QT syndromes and 
idiopathic ventricular fibrillation are caused by mutations in potassium and sodium channels (Cooper, 
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E.C. and L.Y. Jan (1998) Proc. Natl. Acad. Sci. USA 96:4759-4766). All four known human 
idiopathic epilepsy genes code for ion channel proteins (Berkovic, S.F. and LE. Scheffer (1999) Curr. 
Opin. Neurology 12:177-182). Other neurological disorders such as ataxias, hemiplegic migraine and 
hereditary deafness can also result from mutations in ion channel genes (Jen, J. (1999) Curr. Opin. 
5 Neurobiol. 9:274-280; Cooper, supra). 

Ion channels have been the target for many drug therapies. Neurotransmitter-gated channels 
have been targeted in therapies for treatment of insomnia, anxiety, depression, and schizophrenia. 
Voltage-gated channels have been targeted in therapies for arrhythmia, ischemic stroke, head trauma, 
and neurodegenerative disease (Taylor, CP. and L.S. Narasimhan (1997) Adv. Pharmacol. 39:47-98). 

10 Various classes of ion channels also play an important role in the perception of pain, and thus are 
potential targets for new analgesics. These include the vanilloid-gated ion channels, which are 
activated by the vanilloid capsaicin, as well as by noxious heat. Local anesthetics such as lidocaine 
and mexiletine which blockade voltage-gated Na + channels have been useful in the treatment of 
neuropathic pain (Eglen, supra). 

15 . Ion channels in the immune system have recently been suggested as targets for 

immunomodulation. T-cell activation depends upon calcium signaling, and a diverse set of T-cell 
specific ion channels has been characterized that affect this signaling process. Channel blocking 
agents can inhibit secretion of lymphokines, cell proliferation, and killing of target cells. A peptide 
antagonist of the T-cell potassium channel Kvl.3 was found to suppress delayed-type hypersensitivity 

20 and allogenic responses in pigs, validating the idea of channel blockers as safe and efficacious 

immunosuppressants (Cahalan, M.D. and K.G. Chandy (1997) Curr. Opin. Biotechnol. 8:749-756). 
Expression profiling 

Microarrays are analytical tools used in bioanalysis. A microarray has a plurality of molecules 

spatially distributed over, and stably associated with, the surface of a solid support. Microarrays of 
25 polypeptides, polynucleotides, and/or antibodies have been developed and find use in a variety of 

applications, such as gene sequencing, monitoring gene expression, gene mapping, bacterial 

identification, drug discovery, and combinatorial chemistry. 

One area in particular in which microarrays find use is in gene expression analysis. Array 

technology can provide a simple way to explore the expression of a single polymorphic gene or the 
30 expression profile of a large number of related or unrelated genes. When the expression of a single 

gene is examined, arrays are employed to detect the expression of a specific gene or its variants. 

When an expression profile is examined, arrays provide a platform for identifying genes that are tissue 

specific, are affected by a substance being tested in a toxicology assay, are part of a signaling 
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cascade, carry out housekeeping functions, or are specifically related to a particular genetic 
predisposition, condition, disease, or disorder. 
Expression Profiling in Treatments for Cancer 

Tumor cells stimulate the formation of stroma that secretes various mediators, such as growth 
5 factors, cytokines, and proteases, which are critical for tumor growth. For instance, serum tumor 
necrosis factor alpha (TNF-a ) is increased in the circulation of patients with malignancy. Clinically, 
treatment with TNF-a , also called cachectin, in combination with Interferon-gamma (IFN-y) may 
provide a successful approach to overcome the cellular heterogeneity of advanced breast tumors. 
TNF-a has been demonstrated to be antitumorigenic in MCF-7 cells by inducing apoptosis and 
10 inhibiting proliferation. TNF-a is produced by neutrophils, activated lymphocytes, macrophages, NK 
cells, LAK cells, astrocytes, endothelial cells, smooth muscle cells, and some transformed cells. TNF- 
a occurs as a secreted, soluble form and as a membrane-anchored form, both of which are biologically 
active. Two types of receptors for TNF-a have been described and virtually all cell types studied 
show the presence of one or both of these receptor types. TNF-a and TNF-P are extremely 
15 pleiotropic factors due to the ubiquity of their receptors, to their ability to activate multiple signal 
transduction pathways and to their ability to induce or suppress the expression of a wide number of 
genes. TNF-a and TNF-p play a critical role in mediation of the inflammatory response and in 
mediation of resistance to infections and tumor growth. 

The cytokine interferon gamma (IFN-y ) induces growth arrest in normal human mammary 
20 epithelial cells by establishing a block during mid-Gl phase. IFN-y inhibits the kinase activities of 
cdk2, cdk4 and cdk6 within 24 h of treatment. IFN-y -mediated growth inhibition requires signal 
transducers and activators of transcription (STAT)-1 activation and may require induction of the 
cyclin-dependent kinase inhibitor p21. IFN-y , possibly through the elevation of caspase-8 levels, 
sensitizes human breast tumor cells to a death receptor-mediated, mitochondria-operated pathway of 
apoptosis. IFN-y , also known as TyP e H interferon or immune interferon, is produced primarily by T- 
lymphocytes and natural killer cells. IFN-y exhibits antiproliferative, immunoregulatory and 
proinflammatory activities and is thus important in host defense mechanisms. IFN-y induces the 
production of cytokines, and upregulates the expression of class I and II MHC antigens, Fc receptor, 
and leukocyte adhesion molecules. It modulates macrophage effector functions, influences isotype 
switching and potentiates the secretion of immunoglobulins by B cells. IFN-y also augments TH1 cell 
expansion and may be required for TH1 cell differentiation. The IFN-y receptor has been cloned and 
characterized, and is structurally related to the IL-10 receptor. It is present on almost all cell types 
except mature erythrocytes. 
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Breast cancer 

Breast cancer is the most frequently diagnosed type of cancer in American women and the 
second most frequent cause of cancer death. The lifetime risk of an American woman developing 
breast cancer is 1 in 8, and one-third of women diagnosed with breast cancer die of the disease. A 

5 number of risk factors have been identified, including hormonal and genetic factors. One genetic 
defect associated with breast cancer results in a loss of heterozygosity (LOH) at multiple loci such as 
p53, Rb, BRCA1, and BRCA2. Another genetic defect is gene amplification involving genes such as 
c-myc and c-erbB2 (Her2-neu gene). Steroid and growth factor pathways are also altered in breast 
cancer, notably the estrogen, progesterone, and epidermal growth factor (EGF) pathways. Breast 

10 cancer evolves through a multi-step process whereby premalignant mammary epithelial cells undergo 
a relatively defined sequence of events leading to tumor formation. An early event in tumor 
development is ductal hyperplasia. Cells undergoing rapid neoplastic growth gradually progress to 
invasive carcinoma and become metastatic to the lung, bone, and potentially other organs. Variables 
that may influence the process of tumor progression and malignant transformation include genetic 

15 factors, environmental factors, growth factors, and hormones. 
Lung cancer : 

Lung cancer is the leading cause of cancer death for men and the second leading cause of 
cancer death for women in the U.S. Lung cancers are divided into four histopathologically distinct 
groups. Three groups (squamous cell carcinoma, adenocarcinoma, and large cell carcinoma) are 
20 classified as non-small cell lung cancers (NSCLCs). The fourth group of cancers is referred to as 
small cell lung cancer (SCLC). Deletions on chromosome 3 are common in this disease and are 
thought to indicate the presence of a tumor suppressor gene in this region. Activating mutations in K- 
ras are commonly found in lung cancer and are the basis of one of the mouse models for the disease. 
Colon cancer 

25 While soft tissue sarcomas are relatively rare, more than 50% of new patients diagnosed with 

the disease will die from it. The molecular pathways leading to the development of sarcomas are 
relatively unknown, due to the rarity of the disease and variation in pathology. Colon cancer evolves 
through a multi-step process whereby pre-malignant colonocytes undergo a relatively defined 
sequence of events leading to tumor formation. Several factors participate in the process of tumor 

30 progression and malignant transformation including genetic factors, mutations, and selection. 

To understand the nature of gene alterations in colorectal cancer, a number of studies have 
focused on the inherited syndromes. Familial adenomatous polyposis (FAP), is caused by mutations in 
the adenomatous polyposis coli gene (APC), resulting in truncated or inactive forms of the protein. 
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This tumor suppressor gene has been mapped to chromosome 5q. Hereditary nonpolyposis colorectal 
cancer (HNPCC) is caused by mutations in mis-match repair genes. Although hereditary colon 
cancer syndromes occur in a small percentage of the population and most colorectal cancers are 
considered sporadic, knowledge from studies of the hereditary syndromes can be generally applied. 

5 For instance, somatic mutations in APC occur in at least 80% of sporadic colon tumors. APC 
mutations are thought to be the initiating event in the disease. Other mutations occur subsequently. 
Approximately 50% of colorectal cancers contain activating mutations in ras, while 85% contain 
inactivating mutations in p53. Changes in all of these genes lead to gene expression changes in colon 
cancer. 

10 Osteosarcoma 

Osteosarcoma is the most common malignant bone tumor in children. Approximately 80% of 
patients present with non-metastatic disease. After the diagnosis is made by an initial biopsy, 
treatment involves the use of 3-4 courses of neoadjuvant chemotherapy before definitive surgery, 
followed by post-operative chemotherapy. With currently available treatment regimens, approximately 

15 30-40% of patients with non-metastatic disease relapse after therapy. Currently, there is no 

prognostic factor that can be used at the time of initial diagnosis to predict which patients will have a 
high risk of relapse. The only significant prognostic factor predicting the outcome in a patient with 
non-metastatic osteosarcoma is the histopathologic response of the primary tumor resected at the time 
of definitive surgery. The degree of necrosis in the primary tumor is a reflection of the tumor 

20 response to neoadjuvant chemotherapy. A higher degree of necrosis (good or favorable response) is 
associated with a lower risk of relapse and a better outcome. Patients with a lower degree of 
necrosis (poor or unfavorable response) have a much higher risk of relapse and poor outcome even 
after complete resection of the primary tumor. Unfortunately, poor outcome cannot be altered despite 
modification of post-operative chemotherapy to account for the resistance of the primary tumor to 

25 neoadjuvant chemotherapy. Thus, there is an urgent need to identify prognostic factors that can be 
used at the time of diagnosis to recognize the subtypes of osteosarcomas that have various risks of 
relapse, so that more appropriate chemotherapy can be used at the outset to improve the outcome. 
Ovarian cancer 

Ovarian cancer is the leading cause of death from a gynecologic cancer. The majority of 
30 ovarian cancers are derived from epithelial ceUs, and 70% of patients with epithelial ovarian cancers 
present with late-stage disease. As a result, the long-term survival rates for this disease is very low. 
Identification of early-stage markers for ovarian cancer would significantly increase the survival rate. 



17 



WO 03/083085 PCT/US03/09797 

Genetic variations involved in ovarian cancer development include mutation of p53 and microsatellite 
instability. Gene expression patterns likely vary when normal ovary is compared to ovarian tumors. 
Immune response 

Human peripheral blood mononuclear cells (PBMCs) can be classified into discrete cellular 

5 populations representing the major cellular components of the immune system. PBMCs contain about 
52% lymphocytes (12% B lymphocytes, 40% T lymphocytes, 20% NK cells, monocytes, and 3% 
various cells that include dendritic cells and progenitor cells. The proportions, as well as the biology of 
these cellular components tend to vary slighdy between healthy individuals, depending on factors such 
as age, gender, past medical history, and genetic background. 

10 Tumor necrosis factor alpha (TNF-a), also called cachectin, is produced by neutrophils, 

activated lymphocytes, macrophages, NK cells, LAK cells, astrocytes, endothelial cells, smooth 
muscle cells, and some transformed cells. TNF-a occurs as a secreted, soluble form and a 
membrane-anchored form, both of which are biologically active. Two types of receptors for TNF-a 
have been described, and virtually all cell types studied show the presence of one or both of these 

15 receptor types. TNF- a and TNF-p are extremely pleiotropic factors due to the ubiquity of their 

receptors, their ability to activate multiple signal transduction pathways, and their ability to induce or 
suppress the expression of a wide number of genes. TNF-a and TNF-p play a critical role in 
mediation of the inflammatory response and in mediation of resistance to infections and tumor growth. 
There is a need in the art for new compositions, including nucleic acids and proteins, for the 

20 diagnosis, prevention, and treatment of transport, neurological, muscle, immunological and cell 
proliferative disorders. 

SUMMARY OF THE INVENTION 

Various embodiments of the invention provide purified polypeptides, transporters and ion 
25 channels, referred to collectively as "miCIT and individually as "TRICH-1,' "TRICH-2,' "TRICH-3/ 
'TRICH-4,' 'TRICH-5,' 'TRICH-6,' 'TRICH-7,' 'TRICH-8/ 'TRICH-9,' "TRICH-10,' TRICH- 
IV TRICH-12,' "TRICH-13/ "TRICH-14,' 'TRICH-15,' TRICH-16,' 'TRICH-17,' "DUCH-18,' 
'TRICH-19,' "TRICH-20,' "TRICH~21,' "rRICH-22,' "TRICH-23,' 4 TRICH-24,' 'TRICH-25,' 
'TRICH-26,' 'TRICH-27,' TRICH-28,' 'TRICH-29,' "TRICH-30/ 'TRICH-31,' "TRICH-32,' 
30 'TRICH-33,' 'TRICH-34,' 'TRICH-35,' 'TRICH-36/ 'TRICH-37,' 'TRICH-38,' 'TRICH-39,' 
'TRICH-40,' 'TRICH-41,' 'TRICH-42,' 'TRICH-43,' TRICH-44,' "TRICH-45,' 'TRICH-46,' 
'TRICH-47,' "TRICEPS,' 'TRICH-49,' 'TRICH-50,' 'TRICH-51/ "TRICH-52,' 'TRICH- 
53,'0'TRICH-54,\TRICH-55,* 'TRICH-56,' "TRICH-57,' 'TRICH-58/ and "TRICH-59 'and 
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methods for using these proteins and their encoding polynucleotides for the detection, diagnosis, and 
treatment of diseases and medical conditions. Embodiments also provide methods for utilizing the 
purified transporters and ion channels and/or their encoding polynucleotides for facilitating the drug 
discovery process, including determination of efficacy, dosage, toxicity, and pharmacology. Related 
5 embodiments provide methods for utilizing the purified transporters and ion channels and/or their 
encoding polynucleotides for investigating the pathogenesis of diseases and medical conditions. 

An embodiment provides an isolated polypeptide selected from the group consisting of a) a 
polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NO:l- 
59, b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% identical or at 

10 least about 90% identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-59, c) a biologically active fragment of a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-59, and d) an immunogenic fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO:l-59. Another 
embodiment provides an isolated polypeptide comprising an amino acid sequence of SEQ ID NO: 1-59. 

15 i Still another embodiment provides an isolated polynucleotide encoding a polypeptide selected 
from the group consisting of a) a polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO:l~59, b) a polypeptide comprising a naturally occurring amino acid 
sequence at least 90% identical or at least about 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-59, c) a biologically active fragment of a polypeptide 

20 having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-59, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group, 
consisting of SEQ ID NO: 1-59. In another embodiment, the polynucleotide encodes a polypeptide 
selected from the group consisting of SEQ ID NO: 1-59. In an alternative embodiment, the 
polynucleotide is selected from the group consisting of SEQ ID NO:60-118. 

25 Still another embodiment provides a recombinant polynucleotide comprising a promoter 

sequence operably linked to a polynucleotide encoding a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting 
of SEQ ID NO:l-59, b) a polypeptide comprising a naturally occurring amino acid sequence at least 
90% identical or at least about 90% identical to an amino acid sequence selected from the group 

30 consisting of SEQ ID NO: 1-59, c) a biologically active fragment of a polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-59, and d) an immunogenic fragment of 
a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO:l-59. 
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Another embodiment provides a cell transformed with the recombinant polynucleotide. Yet another 
embodiment provides a transgenic organism comprising the recombinant polynucleotide. 

Another embodiment provides a method for producing a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting 

5 of SEQ ID NO: 1-59, b) a polypeptide comprising a naturally occurring amino acid sequence at least 
90% identical or at least about 90% identical to an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-59, c) a biologically active fragment of a polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-59, and d) an immunogenic fragment of 
a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-59. 

10 The method comprises a) culturing a cell under conditions suitable for expression of the polypeptide, 
wherein said cell is transformed with a recombinant polynucleotide comprising a promoter sequence 
operably linked to a polynucleotide encoding the polypeptide, and b) recovering the polypeptide so 
. expressed. 

Yet another embodiment provides an isolated antibody which specifically binds to a 

15 . polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-59, b) a polypeptide comprising a naturally 
occurring amino acid sequence at least 90% identical or at least about 90% identical to an amino acid 
. sequence selected from the group consisting of SEQ ID NO: 1-59, c) a biologically active fragment of 
a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-59, 

20 and d) an immunogenic fragment of a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-59. 

Still yet another embodiment provides an isolated polynucleotide selected from the group 
consisting of a) a polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:60-118, b) a polynucleotide comprising a naturally occurring polynucleotide 

25 sequence at least 90% identical or at least about 90% identical to a polynucleotide sequence selected 
from the group consisting of SEQ ID NO:60-1 18, c) a polynucleotide complementary to the 
polynucleotide of a), d) a polynucleotide complementary to the polynucleotide of b), and e) an RNA 
equivalent of a)-d). In other embodiments, the polynucleotide can comprise at least about 20, 30, 40, 
60, 80, or 100 contiguous nucleotides. 

30 Yet another embodiment provides a method for detecting a target polynucleotide in a sample, 

said target polynucleotide being selected from the group consisting of a) a polynucleotide comprising a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:60-1 18, b) a polynucleotide 
comprising a naturally occurring polynucleqtide sequence at least 90% identical or at least about 90% 
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identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO.60-118, c) a 
polynucleotide complementary to the polynucleotide of a), d) a polynucleotide complementary to the 
polynucleotide of b), and e) an RNA equivalent of a)-d). The method comprises a) hybridizing the 
sample with a probe comprising at least 20 contiguous nucleotides comprising a sequence 
5 complementary to said target polynucleotide in the sample, and which probe specifically hybridizes to 
said target polynucleotide, under conditions whereby a hybridization complex is formed between said 
probe and said target polynucleotide or fragments thereof, and b) detecting the presence or absence of 
said hybridization complex. In a related embodiment, the method can include detecting the amount of 
the hybridization complex. In still other embodiments, the probe can comprise at least about 20, 30, 

10 40, 60, 80, or 100 contiguous nucleotides. 

Still yet another embodiment provides a method for detecting a target polynucleotide in a . 
sample, said target polynucleotide being selected from the group consisting of a) a polynucleotide 
comprising a polynucleotide sequence selected from the group consisting of SEQ ID NO:60-118, b) a 
polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical or at 

15 least about 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:60-1 18, c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide . 
complementary to the polynucleotide of b), and e) an RNA equivalent of a)-d). The method 
comprises a) amplifying said target polynucleotide or fragment thereof using polymerase chain 
reaction amplification, and b) detecting the presence or absence of said amplified target polynucleotide 

20 or fragment thereof. In a related embodiment, the method can include detecting the amount of the 
amplified target polynucleotide or fragment thereof. 

Another embodiment provides a composition comprising an effective amount of a polypeptide 
selected from the group consisting of a) a polypeptide comprising an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-59, b) a polypeptide comprising a naturally occurring 

25 amino acid sequence at least 90% identical or at least about 90% identical to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-59, c) a biologically active fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-59, 
and d) an immunogenic fragment of a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NG:l-59, and a pharmaceutical^ acceptable excipient. In one 

30 embodiment, the composition can comprise an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1-59. Other embodiments provide a method of treating a disease or condition 
associated with decreased or abnormal expression of functional TRICH, comprising administering to a 
patient in need of such treatment the composition. 
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Yet another embodiment provides a method for screening a compound for effectiveness as an 
agonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-59, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 

5 amino acid sequence selected from the group consisting of SEQ ID NO:l-59, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-59, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-59. The method comprises a) exposing a sample 
comprising the polypeptide to a compound, and b) detecting agonist activity in the sample. Another 

10 embodiment provides a composition comprising an agonist compound identified by the method and a 
pharmaceutically acceptable excipient Yet another embodiment provides a method of treating a 
disease or condition associated with decreased expression of functional TRICH, comprising 
administering to a patient in need of such treatment the composition. 

Still yet another embodiment provides a method for screening a compound for effectiveness . 

15 as an antagonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an 
amino acid sequence selected from the group consisting of SEQ ID NO:l-59, b) a polypeptide 
comprising a naturally occurring amino acid sequence at least 90% identical or at least about 90% 
identical to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-59, c) a 
biologically active fragment of a polypeptide having an amino acid sequence selected from the group 

20 consisting of SEQ ID NO: 1-59, and d) an immunogenic fragment of a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-59. The method comprises a) 
exposing a sample comprising the polypeptide to a compound, and b) detecting antagonist activity in 
the sample. Another embodiment provides a composition comprising an antagonist compound 
identified by the method and a pharmaceutically acceptable excipient. Yet another embodiment 

25 provides a method of treating a disease or condition associated with overexpression of functional 
TRICH, comprising administering to a patient in need of such treatment the composition. 

Another embodiment provides a method of screening for a compound that specifically binds to 
a polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-59, b) a polypeptide comprising a 

30 naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-59, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-59, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
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selected from the group consisting of SEQ ID NO: 1-59. The method comprises a) combining the 
polypeptide with at least one test compound under suitable conditions, and b) detecting binding of the 
polypeptide to the test compound, thereby identifying a compound that specifically binds to the 
polypeptide. 

5 Yet another embodiment provides a method of screening for a compound that modulates the 

activity of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-59, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-59, c) a biologically active 

10 fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-59, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-59. The method comprises a) combining the 
polypeptide with at least one test compound under conditions permissive for the activity of the 
polypeptide, b) assessing the activity of the polypeptide in the presence of the test compound, and c) 

15 comparing the activity of the polypeptide in the presence of the test compound with the activity of the 
polypeptide in the absence of the test compound, wherein a change in the activity of the polypeptide in 
the presence of the test compound is indicative of a compound that modulates the activity of the 
polypeptide. 

Still yet another embodiment provides a method for screening a compound for effectiveness in 
20 altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:60-118, the method * 
comprising a) exposing a sample comprising the target polynucleotide to a compound, b) detecting 
altered expression of the target polynucleotide, and c) comparing the expression of the target 
polynucleotide in the presence of varying amounts of the compound and in the absence of the 
25 compound. 

Another embodiment provides a method for assessing toxicity of a test compound, said 
method comprising a) treating a biological sample containing nucleic acids with the test compound; b) 
hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynucleotide selected from the group consisting of i) a polynucleotide 
30 comprising a polynucleotide sequence selected from the group consisting of SEQ ID NO:60-1 18, ii) a 
polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical or at 
least about 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 
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NO:60-118, iii) a polynucleotide having a sequence complementary to i), iv) a polynucleotide 
complementary to the polynucleotide of ii), and v) an RNA equivalent of i)-iv). Hybridization occurs 
under conditions whereby a specific hybridization complex is formed between said probe and a target 
polynucleotide in the biological sample, said target polynucleotide selected from the group consisting of 

5 i) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:60-118, ii) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
90% identical or at least about 90% identical to a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:60-118, iii) a polynucleotide complementary to the polynucleotide of i), iv) a 
polynucleotide complementary to the polynucleotide of ii), and v) an RNA equivalent of i)-iv). 

10 Alternatively, the target polynucleotide can comprise a fragment of a polynucleotide selected from the 
group consisting of i)-v) above; c) quantifying the amount of hybridization complex; and d) comparing 
the amount of hybridization complex in the treated biological sample with the amount of hybridization 
complex in an untreated biological sample, wherein a difference in the amount of hybridization 
complex in the treated biological sample is indicative of toxicity of the test compound. 

15 

BRIEF DESCRIPTION OF THE TABLES 

Table 1 summarizes the nomenclature for full length polynucleotide and polypeptide 
embodiments of the invention. 

Table 2 shows the GenBank identification number and annotation of the nearest GenBank 
20 homolog, and the PROTEOME database identification numbers and annotations of PROTEOME 
database homologs, for polypeptide embodiments of the invention. The probability scores for the 
matches between each polypeptide and its homolog(s) are also shown. 

Table 3 shows structural features of polypeptide embodiments, including predicted motifs and 
domains, along with the methods, algorithms, and searchable databases used for analysis of the 
25 polypeptides. 

Table 4 lists the cDNA and/or genomic DNA fragments which were used to assemble 
polynucleotide embodiments, along with selected fragments of the polynucleotides. 

Table 5 shows representative cDNA libraries for polynucleotide embodiments. 

Table 6 provides an appendix which describes the tissues and vectors used for construction of 
30 the cDNA libraries shown in Table 5. 

Table 7 shows the tools, programs, and algorithms used to analyze polynucleotides and 
polypeptides, along with applicable descriptions, references, and threshold parameters. 
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Table 8 shows single nucleotide polymorphisms found in polynucleotide sequences of the 
invention, along with allele frequencies in different human populations. 

DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleic acids, and methods are described, it is understood that 
embodiments of the invention are not limited to the particular machines, instruments, materials, and 
methods described, as these may vary. It is also to be understood that the terminology used herein is 
for the purpose of describing particular embodiments only, and is not intended to limit the scope of the 
invention. 

As used herein and in the appended claims, the singular forms "a," "an," and "the" include 
plural reference unless the context clearly dictates, otherwise. Thus, for example, a reference to "a 
host cell" includes a plurality of such host cells, and a reference to "an antibody" is a reference to one 
or more antibodies and equivalents thereof known to those skilled in the art, and so forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meanings as commonly understood by one of ordinary skill in the art to which this invention belongs. 
Although any machines, materials, and methods similar or equivalent to those described herein can be 
used to practice or test the present invention, the preferred machines, materials and methods are now 
described. All publications mentioned herein are cited for the purpose of describing and disclosing the 
cell lines, protocols, reagents and vectors which are reported in the publications and which might be 
used in connection with various embodiments of the invention. Nothing herein is to be construed as an 
admission that the invention is not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

"TRICH" refers to the amino acid sequences of substantially purified TRICH obtained from 
any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 

The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
TRICH. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of TRICH either by directly interacting with 
TRICH or by acting on components of the biological pathway in which TRICH participates. 

An "allelic variant" is an alternative form of the gene encoding TRICH. Allelic variants may 
result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
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many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur alone, or in combination with the others, one or more times 
in a given sequence. 

5 "Altered" nucleic acid sequences encoding TRICH include those sequences with deletions, 

insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as TRICH or a 
polypeptide with at least one functional characteristic of TRICH. Included within this definition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe of 
the polynucleotide encoding TRICH, and improper or unexpected hybridization to allelic variants, with 

10 a locus other than the normal chromosomal locus for the polynucleotide encoding TRICH. The 

encoded protein may also be "altered," and may contain deletions, insertions, or substitutions of amino 
acid residues which produce a silent change and result in a functionally equivalent TRICH. Deliberate 
amino acid substitutions may be made on the basis of one or more similarities in polarity, charge, 
solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues, as long as the 

15 biological or immunological activity of TRICH is retained. For example, negatively charged amino 
acids may include aspartic acid and glutamic acid, and positively charged amino acids may include 
lysine and arginine. Amino acids with uncharged polar side chains having similar hydrophilicity values 
may include: asparagine and glutamine; and serine and threonine. Amino acids with uncharged side 
chains having similar hydrophilicity values may include: leucine, isoleucine, and valine; glycine and 

20 alanine; and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" can refer to an oligopeptide, a peptide, a 
polypeptide, or a protein sequence, or a fragment of any of these, and to naturally occurring or 
synthetic molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally 
occurring protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid 

25 sequence to the complete native amino acid sequence associated with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid. Amplification 
may be carried out using polymerase chain reaction (PCR) technologies or other nucleic acid 
amplification technologies well known in the art. 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity 

30 of TRICH. Antagonists may include proteins such as antibodies, anticalins, nucleic acids, 

carbohydrates, small molecules, or any other compound or composition which modulates the activity of 
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pathway m which TRICH participates. 
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The term "spiegelmer" refers to an aptamer which includes L-DNA, L-RNA, or other left- 
handed nucleotide derivatives or nucleotide-like molecules. Aptamers containing left-handed 
nucleotides are resistant to degradation by naturally occurring enzymes, which normally act on 
substrates containing right-handed nucleotides. 

5 The term "antisense" refers to any composition capable of base-pairing with the "sense" 

(coding) strand of a polynucleotide having a specific nucleic acid sequence. Antisense compositions 
may include DNA; RNA; peptide nucleic acid (PNA); oligonucleotides having modified backbone 
linkages such as phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides 
having modified sugar groups such as 2-methoxyethyl sugars or 2-methoxyethoxy sugars; or 

10 oligonucleotides having modified bases such as 5-methyl cytosine, 2'-deoxyuracil, or 7-deaza-2 - 

deoxyguanosine. Antisense molecules may be produced by any method including chemical synthesis 
or transcription. Once introduced into a cell, the complementary antisense molecule base-pairs with a 
naturally occurring nucleic acid sequence produced by the cell to form duplexes which block either 
transcription or translation. The designation "negative" or "minus" can refer to the antisense strand, 

15 and the designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 

r The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
functions of a naturally occurring molecule. Likewise, "immunologically active" or "immunogenic" 
refers to the capability of the natural, recombinant, or synthetic TRICH, or of any oligopeptide thereof, 
to induce a specific immune response in appropriate animals or cells and to bind with specific 

20 antibodies. 

"Complementary" describes the relationship between two single-stranded nucleic acid 
sequences that anneal by base-pairing. For example, 5-AGT-3 1 pairs with its complement, 
3'-TCA-5\ 

A "composition comprising a given polynucleotide" and a "composition comprising a given 
25 polypeptide" can refer to any composition containing the given polynucleotide or polypeptide. The 
composition may comprise a dry formulation or an aqueous solution. Compositions comprising 
polynucleotides encoding TRICH or fragments of TRICH may be employed as hybridization probes. 
The probes may be stored in freeze-dried form and may be associated with a stabilizing agent such as 
a carbohydrate. In hybridizations, the probe may be deployed in an aqueous solution containing salts 
30 (e.g., NaCl), detergents (e.g., sodium dodecyl sulfate; SDS), and other components (e.g., Denhardfs 
solution, dry milk, salmon sperm DNA, etc.). 
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"Consensus sequence" refers to a nucleic acid sequence which has been subjected to 
repeated DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (Applied 
Biosystems, Foster City CA) in the 5' and/or the 3 1 direction, and resequenced, or which has been 
assembled from one or more overlapping cDNA, EST, or genomic DNA fragments using a computer 
program for fragment assembly, such as the GELVIEW fragment assembly system (Accelrys, 
Burlington MA) or Phrap (University of Washington, Seattle WA). Some sequences have been both 
extended and assembled to produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least 
interfere with the properties of the original protein, i.e., the structure and especially the function of the 
protein is conserved and not significantly changed by such substitutions. The table below shows amino 
acids which may be substituted for an original amino acid in a protein and which are regarded as 
conservative amino acid substitutions. 

Original Residue Conservative Substitution 

... Ma Gly, Ser " 
M Z His,Lys 
Asn Asp, Gin, His 
As P Asn,Glu 
c ys Ala, Ser 
Gln Asn,Glu,His 
. Glu Asp, Gin, His 
G }y Ala 
^ Asn, Arg, Gin, Glu 
De Leu, Val 
Leu De, Val 
L y s Arg, Gin, Glu 
Met Leu, De 
Phe His, Met, Leu, Trp, Tyr 
Ser Cys, Thr 
Thr Ser, Val 
Tr P Phe, Tyr 
Tyr His, Phe, Trp 
Yal He, Leu, Thr 

Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 
backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of 
the side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 
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The term "derivative" refers to a chemically modified polynucleotide or polypeptide. 
Chemical modifications of a polynucleotide can include, for example, replacement of hydrogen by an 
alkyl, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which retains 
at least one biological or immunological function of the natural molecule. A derivative polypeptide is 

5 one modified by glycosylation, pegylation, or any similar process that retains at least one biological or 
immunological function of the polypeptide from which it was derived. 

A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalendy or noncovalendy joined to a polynucleotide or polypeptide. 

'Differential expression" refers to increased or upregulated; or decreased, downregulated, or 

10 absent gene or protein expression, determined by comparing at least two different samples. Such 
comparisons may be carried out between, for example, a treated and an untreated sample, or a 
diseased and a normal sample. 

"Exon shuffling" refers to the recombination of different coding regions (exons). Since an 
exon may represent a structural or functional domain of the encoded protein, new proteins may be 

15 assembled through the novel reassortment of stable substructures, thus allowing acceleration of the 
evolution of new protein functions. 

A "fragment" is a unique portion of TRICH or a polynucleotide encoding TRICH which can 
be identical in sequence to, but shorter in length than, the parent sequence. A fragment may comprise 
up to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For 

20 example, a fragment may comprise from about 5 to about 1000 contiguous nucleotides or amino acid 
residues. A fragment used as a probe, primer, antigen, therapeutic molecule, or for other purposes, 
may be at least 5, 10, 15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous 
nucleotides or amino acid residues in length. Fragments may be preferentially selected from certain 
regions of a molecule. For example, a polypeptide fragment may comprise a certain length of 

25 contiguous amino acids selected from the first 250 or 500 amino acids (or first 25% or 50%) of a 
polypeptide as shown in a certain defined sequence. Clearly these lengths are exemplary, and any 
length that is supported by the specification, including the Sequence Listing, tables, and figures, may be 
encompassed by the present embodiments. 

A fragment of SEQ ID NO:60-118 can comprise a region of unique polynucleotide sequence 

30 that specifically identifies SEQ ID NO:60-1 1 8, for example, as distinct from any other sequence in the 
genome from which the fragment was obtained. A fragment of SEQ ID NO:60-118 can be employed 
in one or more embodiments of methods of the invention, for example, in hybridization and 
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amplification technologies and in analogous methods that distinguish SEQ ID NO:60-118 from related 
polynucleotides. The precise length of a fragment of SEQ ID NO:60-1 18 and the region of SEQ ID 
NO:60-1 18 to which the fragment corresponds are routinely determinable by one of ordinary skill in 
the art based on the intended purpose for the fragment. 
5 A fragment of SEQ ID NO:l-59 is encoded by a fragment of SEQ ID NO:60-118. A 

fragment of SEQ ID NO: 1-59 can comprise a region of unique amino acid sequence that specifically 
identifies SEQ ID NO:l-59. For example, a fragment of SEQ ID NO:l-59 can be used as an 
immunogenic peptide for the development of antibodies that specifically recognize SEQ ID NO: 1-59. 
The precise length of a fragment of SEQ ID NO: 1-59 and the region of SEQ ID NO: 1-59 to which 
10 the fragment corresponds can be determined based on the intended purpose for the fragment using 
one or more analytical methods described herein or otherwise known in the art. 

A "full length" polynucleotide is one containing at least a translation initiation codon (e.g., 
methionine) followed by an open reading frame and a translation termination codon. A "full length" 
polynucleotide sequence encodes a "full length" polypeptide sequence. 
15 "Homology" refers to sequence similarity or, alternatively, sequence identity, between two, or 

more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity" and ".% identity," as applied to polynucleotide sequences, refer to 
the percentage of identical nucleotide matches between at least two polynucleotide sequences aligned 
using a standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, 
20 gaps in the sequences being compared in order to optimize alignment between two sequences, and 
therefore achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using one or more 
computer algorithms or programs known in the art or described herein. For example, percent identity 
can be determined using the default parameters of the CLUSTAL V algorithm as incorporated into 
25 the MEGALIGN version 3.12e sequence alignment program. This program is part of the 

LASERGENE software package, a suite of molecular biological analysis programs (DNASTAR, 
Madison WI). CLUSTAL V is described in Higgins, D.G. and P.M. Sharp (1989; CABIOS 5:151- 
153) and in Higgins, D.G. et al. (1992; CABIOS 8:189-191). For pairwise alignments of 
polynucleotide sequences, the default parameters are set as follows: Ktuple=2, gap penalty=5, 
30 windows, and "diagonals saved"=4. The "weighted" residue weight table is selected as the default. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms 
which can be used is provided by the National Center for Biotechnology Information (NCBI) Basic 
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Local Alignment Search Tool (BLAST) (Altschul, S.R et al. (1990) J. MoL Biol. 215:403-410), which 
is available from several sources, including the NCBI, Bethesda, MD, and on the Internet at 
http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence analysis 
programs including "blastn ," that is used to align a known polynucleotide sequence with other 
5 polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/gor^bl2.html. The 
"BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
10 compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.12 (April-21-2000) set at default parameters. Such default parameters may be, for example: 

Matrix: BLOSUM62 

Reward for match: 1 

Penalty for mismatch: -2 
15 Open Gap: 5 and Extension Gap: 2 penalties 

Gap x drop-off: 50 

Expect: 10 

Word Size: 11 

Filter: on 

20 Percent identity may be measured over the length of an entire defined sequence, for example, 

as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, 
over the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at 
least 20, at least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous 
nucleotides. Such lengths are exemplary only, and it is understood that any fragment length supported 

25 by the sequences shown herein, in the tables, figures, or Sequence Listing, may be used to describe a 
length over which percentage identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes 
in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid 

30 sequences that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 
the percentage of identical residue matches between at least two polypeptide sequences aligned using 
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a standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some 
alignment methods take into account conservative amino acid substitutions. Such conservative 
substitutions, explained in more detail above, generally preserve the charge and hydrophobicity at the 
site of substitution, thus preserving the structure (and therefore function) of the polypeptide. The 
5 phrases "percent similarity" and "% similarity," as applied to polypeptide sequences, refer to the 
percentage of residue matches, including identical residue matches and conservative substitutions, 
between at least two polypeptide sequences aligned using a standardized algorithm. In contrast, 
conservative substitutions are not included in the calculation of percent identity between polypeptide 
sequences. 

10 Percent identity between polypeptide sequences may be determined using the default 

parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e 
sequence alignment program (described and referenced above). For pairwise alignments of 
polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap 
penalty=3, window=5, and "diagonals saved"=5. The PAM250 matrix is selected as the default 

15 residue weight table. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 
2.0. 12 (April-21-2000) with blastp set at default parameters. Such default parameters may be, for 
example: 
20 Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 

Gap x drop-off: 50 

Expect: 10 

Word Size: 3 
25 Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, 
for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, 
for example, over the length of a fragment taken from a larger, defined polypeptide sequence, for 
instance, a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 
30 150 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment 
length supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be 
used to describe a length over which percentage identity may be measured. 
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'Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size and which contain all of the elements required for 
chromosome replication, segregation and maintenance. 

The term "humanized antibody" refers to an antibody molecule in which the amino acid 

5 sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing under defined hybridization conditions. Specific 
hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 

10 Specific hybridization complexes form under permissive annealing conditions and remain hybridized 
after the "washing" step(s). The washing step(s) is particularly important in determining the 
stringency of the hybridization process, with more stringent conditions allowing less non-specific 
binding, i.e., binding between pairs of nucleic acid strands that are not perfectly matched. Permissive 
conditions for annealing of nucleic acid sequences are routinely determinable by one of ordinary skill in 

15 the art and may be consistent among hybridization experiments, whereas wash conditions may be 
• varied among experiments to achieve the desired stringency, and therefore hybridization specificity. 
Permissive annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about 
1% (w/v) SDS, and about 100 ptg/ml sheared, denatured salmon sperm DNA. 

Generally, stringency of hybridization is expressed, in part, with reference to the temperature 

20 under which the wash step is carried out. Such wash temperatures are typically selected to be about 
5°C to 20°C lower than the thermal melting point (TJ for the specific sequence at a defined ionic 
strength and pH. The T ra is the temperature (under defined ionic strength and pH) at which 50% of 
the target sequence hybridizes to a perfectly matched probe. An equation for calculating T m and 
conditions for nucleic acid hybridization are well known and can be found in Sambrook, J. and D.W. 

25 Russell (2001; Molecular Cloning: A Laboratory Manual, 3rd ed., vol. 1-3, Cold Spring Harbor Press, 
Cold Spring Harbor NY, ch. 9). 

High stringency conditions for hybridization between polynucleotides of the present invention 
include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1% SDS, for 1 hour. 
Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC concentration may 

30 be varied from about 0. 1 to 2 x SSC, with SDS being present at about 0. 1%. Typically, blocking 
reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, 
sheared and denatured salmon sperm DNA at about 100-200 jtg/ml. Organic solvent, such as 
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formamide at a concentration of about 35-50% v/v, may also be used under particular cixcumstances 
such as for RNA:DNA hybridizations. Useful vacations on these wash conditions will be readily ' 
a P parenttomose 0 f OT dinaryskmintheart. Hybridization, particularly under high stringency ' 
conditions, may be suggestive of evolutionary similarity between the nucleotides. Such similarity is 
5 strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 

The term "hybridization complex" refers to a complex formed between two nucleic acids by 
virtue of the fonnation of hydrogen bonds between complementary bases. A hybridization complex 
may be formed in solution (e.g., Qt or R^ analysis) or formed between one nucleic acid present in 
solution and another nucleic acid immobilized on a solid support (e.g., paper, membranes, filters, chips 
10 pms or glass slides, or any other appropriate substrate to which cells or their nucleic acids have been 
fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or polynucleotide 
sequence resulting in the addition of one or more amino acidresidues or nucleotides, respectively. 
"Lnmune response" can refer to conditions associated with inflammation, trauma, immune 
15 disorders, or infectious or genetic disease, etc. . These conditions can be characterized by expression 
of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect 
cellular and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of TRICH which is 
capable of eliciting an immune response when introduced into a living organism, for example, a 
20 mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment 
of TRICH which is useful in any of the antibody production methods disclosed herein or known in the 



art. 



The term "microarray" refers to an arrangement of a plurality of polynucleotides, 
polypeptides, antibodies, or other chemical compounds on a substrate. 
25 The terms "element" and "array element" refer to a polynucleotide, polypeptide, antibody, or 

other chemical compound having a unique and defined position on a microarray. 

The term "modulate" refers to a change in the activity of TRICH. For example, modulation 
may cause an increase or a decrease in protein activity, binding characteristics, or any other biological, 
functional, or immunological properties of TRICK 
30 The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide 

polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic 
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synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
5 linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 

10 amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. PNAs 
preferentially bind complementary single stranded DNA or RNA and stop transcript elongation, and 
may be pegylated to extend their lifespan in the cell. 

"Post-translational modification" of an TRICH may involve lipidation, glycosylation, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in the 

15 art. These processes may occur synthetically or biochemically. Biochemical modifications will vary 
by cell type depending on the enzymatic milieu of TRICH. 

"Probe" refers to nucleic acids encoding TRICH, their complements, or fragments thereof, 
which are used to detect identical, allelic or related nucleic acids. Probes are isolated oligonucleotides 
or polynucleotides attached to a detectable label or reporter molecule. Typical labels include 

20 radioactive isotopes, ligands, chemiluminescent agents, and enzymes. "Primers" are short nucleic 
acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide by 
complementary base-pairing. The primer may then be extended along the target DNA strand by a 
DNA polymerase enzyme. Primer pairs can be used for amplification (and identification) of a nucleic 
acid, e.g., by the polymerase chain reaction (PCR). 

25 Probes and primers as used in the present invention typically comprise at least 15 contiguous 

nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers 
may be considerably longer than these examples, and it is understood that any length supported by the 

30 specification, including the tables, figures, and Sequence Listing, may be used. 

Methods for preparing and using probes and primers are described in, for example, Sambrook, 
J. and D.W. Russell (2001; Molecular Cloning: A Laboratory Manual . 3rd ed., vol. 1-3, Cold Spring 
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Harbor Press, Cold Spring Harbor NY), Ausubel, F.M. et al. (1999; Short Prot^k in M„w„w 
Biology, 4* ed., John Wiley & Sons, New York NY), and Innis, M. et al. (1990; PCR Protocols. A 
Guide to Methods and App l^on*, Academic Press, San Diego CA). PCR primer pairs can be 
derived from a known sequence, for example, by using computer programs intended for that purpose 
5 such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge MA). 
Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 
nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer selection 
10 programs have incorporated additional features for expanded capabilities. For example, the PrimOU 
primer selection program (available to the public from the Genome Center at University of Texas 
South West Medical Center, Dallas TX) is capable of choosing specific primers from megabase 
sequences*nd is thus useful for designing primers on a genome-wide scope. The PrimerS primer 
selection program (available to the public from the Whitehead mstitute/MIT Center for Genome 
15- Research, Cambridge MA) allows the user to input a "misprinting library," ii, which sequences to 
avoid as primer binding sites are user-specified. PrimerS is useful, in particular, for the selection of 
oligonucleotides for microarrays. (The source code for the latter two primer selection programs may 
also be obtained from their respective sources and modified to meet the user's specific needs.) The 
PrimeGen program (available to the public from the UK Human Genome Mapping Project Resource 
20 Centre, Cambridge UK) designs primers based on multiple sequence alignments, thereby allowing 
selection of primers that hybridize to either the most conserved or least conserved regions of aligned 
nucleic acid sequences. Hence, this program is useful for identification of both unique and conserved 
oligonucleotides and polynucleotide fragments. The oligonucleotides and polynucleotide fragments 
identified by any of the above selection methods are useful in hybridization technologies, for example, 
25 as PCR or sequencing primers, microarray elements, or specific probes to identify fully or partially 
complementary polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are 
not limited to those described above. 

A "recombinant nucleic acid" is a nucleic acid that is not naturally occurring or has a 
sequence that is made by an artificial combination of two or more otherwise separated segments of 
30 sequence. This artificial combination, is often accomplished by chemical synthesis or, more commonly, 
by the artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering 
techniques such as those described in Sambrook and Russell (supra). The term recombinant includes 
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nucleic acids that have been altered solely by addition, substitution, or deletion of a portion of the 
nucleic acid. Frequently, a recombinant nucleic acid may include a nucleic acid sequence operably 
linked to a promoter sequence. Such a recombinant nucleic acid may be part of a vector that is used, 
for example, to transform a cell. 
5 Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 

vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
expressed, inducing a protective immunological response in the mammal. 

A "regulatory element" refers to a nucleic acid sequence usually derived from untranslated 
regions of a gene and includes enhancers, promoters, introns, and 5' and 3 r untranslated regions 
10 (UTRs). Regulatory elements interact with host or viral proteins which control transcription, 
translation, or RNA stability. 

"Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 
amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
chemiluminescent, or chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and 
15 other moieties known in the art. 

An "RNA equivalent," in reference to a DNA molecule, is composed of the same linear 
sequence of nucleotides as the reference DNA molecule with the exception that all occurrences of 
the nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
instead of deoxyribose. 

20 The term "sample" is used in its broadest sense. A sample suspected of containing TRICH, 

nucleic acids encoding TRICH, or fragments thereof may comprise a bodily fluid; an extract from a 
cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or cDNA, 
in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 

25 protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 

synthetic binding composition. The interaction is dependent upon the presence of a particular structure 
of the protein, e.g., the antigenic determinant or epitope, recognized by the binding molecule. For 
example, if an antibody is specific for epitope "A," the presence of a polypeptide comprising the 
epitope A, or the presence of free unlabeled A, in a reaction containing free labeled A and the 

30 antibody will reduce the amount of labeled A that binds to the antibody. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 
removed from their natural environment and are isolated or separated, and are at least about 60% 
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free, preferably at least about 75% free, and most preferably at least about 90% free from other 
components with which they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acid residues or nucleotides 
by different amino acid residues or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. Ihe substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

A "transcript image" or "expression profile" refers to the collective pattern of gene expression 
by a particular cell type or tissue under given conditions at a given time. 

'Transformation" describes a process by which exogenous DNA is introduced into a recipient 
cell. Transformation may occur under natural or artificial conditions according to various methods . 
well known in the art, and may rely on any known method for the insertion of foreign nucleic acid 
sequences into a prokaryotic or eukaryotic host cell. The method for transformation is selected based 
on the type of host cell being transformed and may include, but is not limited to, bacteriophage or viral 
infection, electroporation, heat shock, lipofection, and particle bombardment. The term "transformed 
cells" includes stably transformed cells in which the inserted DNA is capable of replication either as 
an autonomously replicating plasmid or as part of the host chromosome, as well as transiently 
transformed cells which express the inserted DNA or RNA for limited periods of time. 

A "transgenic organism," as used herein, is any organism, including but not limited to animals 
and plants, in which one or more of the cells of the organism contains heterologous nucleic acid 
introduced by way of human intervention, such as by transgenic techniques well known in the art The 
nucleic acid is introduced into the cell, directly or indirecdy by introduction into a precursor of the cell, 
by way of deliberate genetic manipulation, such as by microinjection or by infection with a 
recombinant virus. In another'embodiment, the nucleic acid can be introduced by infection with a 
recombinant viral vector, such as a lentiviral vector (Lois, C. et al. (2002) Science 295:868-872). The 
term genetic manipulation does not include classical cross-breeding, or in vitro fertilization, but rather 
is directed to the introduction of a recombinant DNA molecule. The transgenic organisms 
contemplated in accordance with the present invention include bacteria, cyanobacteria, fungi, plants 
and animals. The isolated DNA of the present invention can be introduced into the host by methods 
known in the art, for example infection, transfection, transformation or transconjugation. Techniques 
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for transferring the DNA of the present invention into such organisms are widely known and provided 
in references such as Sambrook and Russell (supra). 

A 'Variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having 
at least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of 
5 the nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 
93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater 
sequence identity over a certain defined length. A variant may be described as, for example, an 

10 "allelic" (as defined above), "splice," "species," or "polymorphic" variant. A splice variant may have 
significant identity to a reference molecule, but will generally have a greater or lesser number of 
polynucleotides due to alternate splicing during mRNA processing. The corresponding polypeptide 
may possess additional functional domains or lack domains that are present in the reference molecule. 
Species variants are polynucleotides that vary from one species to another. The resulting polypeptides 

15 will generally have significant amino acid identity relative to each other. A polymorphic variant is a 
variation in the polynucleotide sequence of a particular gene between individuals of a given species. 
Polymorphic variants also may encompass "single nucleotide polymorphisms" (SNPs) in which the 
polynucleotide sequence varies by one nucleotide base. The presence of SNPs may be indicative of, 
for example, a certain population, a disease state, or a propensity for a disease state. 

20 A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 

at least 40% sequence identity or sequence similarity to the particular polypeptide sequence over a 
certain length of one of the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool 
Version 2.0.9 (May-07-1999) set at default parameters. Such a pair of polypeptides may show, for 
example, at least 50%, at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 

25 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, 
or at least 99% or greater sequence identity or sequence similarity over a certain defined length of one 
of the polypeptides. 

THE INVENTION 

30 Various embodiments of the invention include new human transporters and ion channels 

(TRICH), the polynucleotides encoding TRICK, and the use of these compositions for the diagnosis, 
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treatment, or prevention of transport, neurological, muscle, immunological and cell proliferative 
disorders. 

Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 
embodiments of the invention. Each polynucleotide and its corresponding polypeptide are correlated to 
a smgle Incyte project identification number (Incyte Project ID). Each polypeptide sequence is 
denoted by both a polypeptide sequence identification number (Polypeptide SEQ ID NO:) and an 
Incyte polypeptide sequence number (Incyte Polypeptide ID) as shown. Each polynucleotide 
sequence is denoted by both a polynucleotide sequence identification number (Polynucleotide SEQ ID 
NO:) and an Incyte polynucleotide consensus sequence number (Incyte Polynucleotide ID) as shown 
Column 6 shows the Incyte ID numbers of physical, full length clones corresponding to the polypeptide 
and polynucleotide sequences of the invention. The full length clones encode polypeptides which have 
at least 95% sequence identity to the polypeptide sequences shown in column 3. 

Table 2 shows sequences with homology to polypeptide embodiments of the invention as 
identified by BLAST analysis against the GenBank protein (genpept) database and the PROTEOME 
database. Columns 1 and 2 show the polypeptide sequence identification number (Polypeptide SEQ 
ID NO:) and the corresponding Incyte polypeptide sequence number (Incyte Polypeptide ID) for 
polypeptides of the invention. Column 3 shows the GenBank identification number (GenBank ID NO:) 
of the nearest GenBank homolog and the PROTEOME database identification numbers 
(PROTEOME ID NO:) of the nearest PROTEOME database homologs. Column 4 shows the 
probability scores for the matches between each polypeptide and its homolog(s). Column 5 shows the 
annotation of the GenBank and PROTEOME database homolog(s) along with relevant citations where 
applicable, all of which are expressly incorporated by reference herein. 

Table 3 shows various structural features of the polypeptides of the invention. Columns 1 and 

2 show the polypeptide sequence identification number (SEQ ID NO:) and the corresponding Incyte 
polypeptide sequence number (Incyte Polypeptide ID) for each polypeptide of the invention. Column 

3 shows the number of amino acid residues in each polypeptide. Column 4 shows potential 
Phosphorylation sites, and column 5 shows potential glycosylate sites, as determined by the MOTIFS 
program of the GCG sequence analysis software package (Accelrys, Burlington MA). Column 6 
shows amino acid residues comprising signature sequences, domains, and motifs. Column 7 shows 
analytical methods for protein structure/function analysis and in some cases, searchable databases to 
which the analytical methods were applied. 
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Together, Tables 2 and 3 summarize the properties of polypeptides of the invention, and these 
properties establish that the claimed polypeptides are transporters and ion channels. For example, 
SEQ ID NO:7 is 99% identical, from residue Ml to residue E300, to human acetylcholine receptor 
beta-subunit preprotein (GenBank ID g560155) as determined by the Basic Local Alignment Search 

5 Tool (BLAST). (See Table 2.) The BLAST probability score is 1.9e-199, which indicates the 

probability of obtaining the observed polypeptide sequence alignment by chance. SEQ ID NO:7 also 
has homology to the cholinergic receptor (nicotinic) beta 1 subunit, as determined by BLAST analysis 
using the PROTEOME database. SEQ ID NO:7 also contains a neurotransmitter-gated ion-channel 
ligand binding domain and a neurotransmitter-gated ion-channel transmembrane region as determined 

10 by searching for statistically significant matches in the hidden Markov model (HMM)-based PFAM 
database of conserved protein families/domains, and a cation transporter family protein domain as 
determined by searching for statistically significant matches in the hidden Markov model (HMM)- 
based HGRFAM database of conserved protein families/domains. (See Table 3.) Data from 
BLIMPS, MOTIFS, and PROFILESCAN analyses provide further corroborative evidence that SEQ 

15 ID NO:7 is a cholinergic receptor subunit In an alternative example, SEQ ID NO:41 is 97% identical,, 
from residue Ml to residue T241, to human gamma-aminobutyric acidA receptor alpha 2 subunit • 
(GenBank ID g386422) as determined by the Basic Local Alignment Search Tool (BLAST). (See 
Table 2.) The BLAST probability score is 5.4e-215, which indicates the probability of obtaining the 
observed polypeptide sequence alignment by chance. SEQ ID NO:41 also has homology to the alpha 

20 2 subunit of the GABA-A receptor, a chloride channel that is the major inhibitory neurotransmitter 
receptor in the brain as determined by BLAST analysis using the PROTEOME database. SEQ ID 
NO:41 also contains a neurotransmitter-gated ion channel ligand binding domain and a 
neurotransmitter-gated ion channel transmembrane domain, as well as a cation transporter family 
protein domain, as determined by searching for statistically significant matches in the hidden Markov 

25 model (HMM)-based PFAM and TIGRFAM databases of conserved protein families/domains. (See 
Table 3.) Data from BLIMPS, MOTIFS, and PROFILESCAN analyses, and BLAST analyses 
against the PRODOM and DOMO databases, provide further corroborative evidence that SEQ ED 
NO:41 is a GABA receptor. SEQ ID NO:l-6, SEQ ID NO:8-40, and SEQ ID NO:42-59 were 
analyzed and annotated in a similar maimer. The algorithms and parameters for the analysis of SEQ 

30 ID NO:l-59 are described in Table 7. 

As shown in Table 4, the full length polynucleotide embodiments were assembled using cDNA 
sequences or coding (exon) sequences derived from genomic DNA, or any combination of these two 
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types of sequences. Column 1 lists the polynucleotide sequence identification number (Polynucleotide 
SEQ ID NO:), the corresponding Incyte polynucleotide consensus sequence number (Incyte ID) for 
each polynucleotide of the invention, and the length of each polynucleotide sequence in basepairs. 
Column 2 shows the nucleotide start (5') and stop (30 positions of the cDNA and/or genomic 
5 sequences used to assemble the full length polynucleotide embodiments, and of fragments of the 
polynucleotides which are useful, for example, in hybridization or amplification technologies that 
identify SEQ ID NO:60-1 18 or that distinguish between SEQ ID NO:60-1 18 and related 
polynucleotides. 

The polynucleotide fragments described in Column 2 of Table 4 may refer specifically, for 

10 example, to Incyte cDNAs derived from tissue-specific cDNA libraries or from pooled cDNA 

libraries. Alternatively, the polynucleotide fragments described in column 2 may refer to GenBank 
cDNAs or ESTs which contributed to the assembly of the full length polynucleotides. In addition, the 
polynucleotide fragments described in column 2 may identify sequences derived from the ENSEMBL 
(The Sanger Centre, Cambridge, UK) database (ie. y those sequences including the designation 

15 "ENST"). Alternatively, the polynucleotide fragments described in column 2 may be derived from the 
NCBI RefSeq Nucleotide Sequence Records Database (i.e. , those sequences including the 
designation "NM" or "NT") or the NCBI RefSeq Protein Sequence Records (Le. 9 those sequences 
including the designation "NP"). Alternatively, the polynucleotide fragments described in column 2 
may refer to assemblages of both cDNA and Genscan-predicted exons brought together by an "exon 

20 stitching" algorithm. For example, a polynucleotide sequence identified as 

FLJQOOOOC_Nj_N 2 JTinrrjy 3 _N 4 represents a "stitched" sequence in which XXXXXX is the 
identification number of the cluster of sequences to which the algorithm was applied, and YYYYY is the 
number of the prediction generated by the algorithm, and N JM _, if present, represent specific exons 
that may have been manually edited during analysis (See Example V). Alternatively, the 

25 polynucleotide fragments in column 2 may refer to assemblages of exons brought together by an 
"exon-stretching" algorithm. For example, a polynucleotide sequence identified as 
FLXXXXXX_^gAAAAA_gBBBBB_l_N is a "stretched" sequence, with XXXXXX being the Incyte 
project identification number, gAAAAA being the GenBank identification number of the human 
genomic sequence to which the "exon-stretching" algorithm was applied, gBBBBB being the GenBank 

30 identification number or NCBI RefSeq identification number of the nearest GenBank protein homolog, 
and N referring to specific exons (See Example V). In instances where a RefSeq sequence was used 
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as a protein homolog for the "exon-stretehing" algorithm, a RefSeq identifier (denoted by "NM," 
"NP " or "NT") may be used in place of the GenBank identifier (i.e., gBBBBB). 

Alternatively, a prefix identifies component sequences that were hand-edited, predicted from 
genomic DNA sequences, or derived from a combination of sequence analysis methods. The 
5 following Table lists examples of component sequence prefixes and corresponding sequence analysis 
methods associated with the prefixes (see Example IV and Example V). 



Prefix 


Type of analysis and/or examples of programs 


GNN, GFG, 
ENST 


Exon prediction from genomic sequences using, for example, 
GENSCAN (Stanford University,. CA, USA) or FGENES 
(Computer Genomics Group, The Sanger Centre, Cambridge, UK). 


GBI 


Hand-edited analysis of genomic sequences. 


FL 


Stitched or stretched genomic sequences (see Example V). 


INCY 


Full length transcript and exon prediction from mapping of EST 
sequences to the genome. Genomic location and EST composition 
data are combined to predict the exons and resulting transcript 



In some cases, Incyte cDNA coverage redundant with the sequence coverage shown in 
15 Table 4 was obtained to confirm the final consensus polynucleotide sequence, but the relevant Incyte 
cDNA identification numbers are not shown. 

Table 5 shows the representative cDNA libraries for those full length polynucleotides which 
were assembled using Incyte cDNA sequences. The representative cDNA library is the Incyte 
cDNA library which is most frequentiy represented by the Incyte cDNA sequences which were used 
20 to assemble and confirm the above polynucleotides. The tissues and vectors which were used to 
construct the cDNA libraries shown in Table 5 are described in Table 6. 

Table 8 shows single nucleotide polymorphisms (SNPs) found in polynucleotide sequences of 
the invention, along with allele frequencies in different human populations. Columns 1 and 2 show the 
polynucleotide sequence identification number (SEQ ID NO:) and the corresponding Incyte project 
25 identification number (PDD) for polynucleotides of the invention. Column 3 shows the Incyte 

identification number for the EST in which the SNP was detected (EST ID), and column 4 shows the 
identification number for the SNP (SNP DD). Column 5 shows the position within the EST sequence 
at which the SNP is located (EST SNP), and column 6 shows the position of the SNP within the full- 
length polynucleotide sequence (CB1 SNP). Column 7 shows the allele found in the EST sequence. 
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Columns 8 and 9 show the two alleles found at the SNP site. Column 10 shows the amino acid 
encoded by the codon including the SNP site, based upon the allele found in the EST. Columns 11-14 
show the frequency of allele 1 in four different human populations. An entry of n/d (not detected) 
indicates that the frequency of allele 1 in the population was too low to be detected, while n/a (not 
available) indicates that the allele frequency was not determined for the population. 

The invention also encompasses TRICH variants. Various embodiments of TRICH variants 
can have at least about 80%, at least about 90%, or at least about 95% amino acid sequence identity 
to the TRICH amino acid sequence, and can contain at least one functional or structural characteristic 
of TRICH. 

Various embodiments also encompass polynucleotides which encode TRICH. In a particular 
embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected 
from the group consisting of SEQ ID NO:60-1 18, which encodes TRICH. The polynucleotide 
sequences of SEQ ID NO:60-118, as presented in the Sequence Listing, embrace the equivalent RNA 
sequences, wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the 
sugar backbone is composed of ribose instead of deoxyribose. 

The invention also encompasses variants of a polynucleotide encoding TRICH. In particular, 
such a variant polynucleotide will have at least about 70%, or alternatively at least about 85%, or even 
at least about 95% polynucleotide sequence identity to a polynucleotide encoding TRICH. A 
particular aspect of the invention encompasses a variant of a polynucleotide comprising a sequence 
selected from the group consisting of SEQ ID NO:60-1 18 which has at least about 70%, or 
alternatively at least about 85%, or even at least about 95% polynucleotide sequence identity to a 
nucleic acid sequence selected from the group consisting of SEQ ID NO:60-1 18. Any one of the 
polynucleotide variants described above can encode a polypeptide which contains at least one 
functional or structural characteristic of TRICH. 

In addition, or in the alternative, a polynucleotide variant of the invention is a splice variant of a 
polynucleotide encoding TRICH. A splice variant may have portions which have significant sequence 
identity to a polynucleotide encoding TRICH but will generally have a greater or lesser number of 
polynucleotides due to additions or deletions of blocks of sequence arising from alternate splicing 
during mRNA processing. A splice variant may have less than about 70%, or alternatively less than 
about 60%, or alternatively less than about 50% polynucleotide sequence identity to a polynucleotide 
encoding TRICH over its entire length; however, portions of the splice variant will have at least about 
70%, or alternatively at least about 85%, or alternatively at least about 95%, or alternatively 100% 
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polynucleotide sequence identity to portions of the polynucleotide encoding TRICK For example, a 
polynucleotide comprising a sequence of SEQ ID NO:63 is a splice variant of a polynucleotide 
comprising a sequence of SEQ ID NO:66; and a polynucleotide comprising a sequence of SEQ ID 
NO:64 is a splice variant of a polynucleotide comprising a sequence of SEQ ID NO:68. In an 

5 alternative example, a polynucleotide comprising a sequence of SEQ ID NO:97, a polynucleotide 
comprising a sequence of SEQ ID NO:98, a polynucleotide comprising a sequence of SEQ ID NO:99, 
a polynucleotide comprising a sequence of SEQ ID NO: 100, a polynucleotide comprising a sequence 
of SEQ ID NO: 101, a polynucleotide comprising a sequence of SEQ ID NO: 102, and a polynucleotide 
comprising a sequence of SEQ ID NO:l 14 are all splice variants of each other. In a further example, 

10 a polynucleotide comprising a sequence of SEQ ID NO:93 is a splice variant of a polynucleotide 

comprising a sequence of SEQ ID NO:94, a polynucleotide comprising a sequence of SEQ ID NO: 106 
is a splice variant of a polynucleotide comprising a sequence of SEQ ID NO:107, and a polynucleotide 
comprising a sequence of SEQ ID NO: 1 16 is a splice variant of a polynucleotide comprising a 
sequence of SEQ ID NO:l 17. In addition, a polynucleotide comprising a sequence of SEQ ID NO:60 

15 is a splice variant of a polynucleotide comprising a sequence of SEQ ID NO:79, a polynucleotide 

comprising a sequence of SEQ ID NO:67 is a splice variant of a polynucleotide comprising a sequence 
of SEQ ID NO:84, a polynucleotide comprising a sequence of SEQ ID NO:71 is a splice variant of a 
polynucleotide comprising a sequence of SEQ ID NO:75, and a polynucleotide comprising a sequence 
of SEQ ID NO:73 is a splice variant of a polynucleotide comprising a sequence of SEQ ID NO:81. 

20 Any one of the splice variants described above can encode a polypeptide which contains at least one 
functional or structural characteristic of TRICK 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the 
genetic code, a multitude of polynucleotide sequences encoding TRICH, some bearing minimal 
similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 

25 produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
sequence that could be made by selecting combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as applied to the 
polynucleotide sequence of naturally occurring TRICH, and all such variations are to be considered as 
being specifically disclosed. 

30 Although polynucleotides which encode TRICH and its variants are generally capable of 

hybridizing to polynucleotides encoding naturally occurring TRICH under appropriately selected 
conditions of stringency, it may be advantageous to produce polynucleotides encoding TRICH or its 
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derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally occurring 
codons. Codons may be selected to increase the rate at which expression of the peptide occurs in a 
particular prokaryotic or eukaryotic host in accordance with the frequency with which particular 
codons are utilized by the host. Other reasons for substantially altering the nucleotide sequence 
5 encoding TRICH and its derivatives without altering the encoded amino acid sequences include the 
production of RNA transcripts having more desirable properties, such as a greater half-life, than 
transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of polynucleotides which encode TRICH and 
TRICH derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
10 synthetic polynucleotide may be inserted into any of the many available expression vectors and cell 
systems using reagents well known in the art. Moreover, synthetic chemistry may be used to 
introduce mutations into a polynucleotide encoding TRICH or any fragment thereof. 

Embodiments of the invention can also include polynucleotides that are capable of hybridizing 
to the claimed polynucleotides, and, in particular, to those having the sequences shown in SEQ ED 

15 NO:60-1 18 and fragments thereof, under various conditions of stringency (Wahl, G.M. and S.L. 

Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 152:507-511). 
Hybridization conditions, including annealing and wash conditions, are described in "Definitions." 

Methods for DNA sequencing are well known in the art and may be used to practice any of 
the embodiments of the invention. The methods may employ such enzymes as the Klenow fragment 

20 of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Applied 
Biosystems), thermostable T7 polymerase (Amersham Biosciences, Piscataway NJ), or combinations 
of polymerases and proofreading exonucleases such as those found in the ELONGASE amplification 
system (Invitrogen, Carlsbad CA). Preferably, sequence preparation is automated with machines 
such as the MICROLAB 2200 liquid transfer system (Hamilton, Reno NV), PTC200 thermal cycler 

25 (MJ Research, Watertown MA) and ABI CATALYST 800 thermal cycler (Applied Biosystems). 
Sequencing is then carried out using either the ABI 373 or 377 DNA sequencing system (Applied 
Biosystems), the MEGABACE 1000 DNA sequencing system (Amersham Biosciences), or other 
systems known in the art. The resulting sequences are analyzed using a variety of algorithms which 
are weU known in the art (Ausubel et al, supra, ch. 7; Meyers, R.A. (1995) Molecular Biolo gy and 

30 Biotechnology . Wiley VCH, New York NY, pp. 856-853). 

The nucleic acids encoding TRICH may be extended utilizing a partial nucleotide sequence 
and employing various PCR-based methods known in the art to detect upstream sequences, such as 
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promoters and regulatory elements. For example, one method which may be employed, restriction-site 
PCR, uses universal and nested primers to amplify unknown sequence from genomic DNA within a 
cloning vector (Sarkar, G. (1993) PCR Methods Applic. 2:318-322). Another method, inverse PCR, 
uses primers that extend in divergent directions to amplify unknown sequence from a circularized 
5 template. The template is derived from restriction fragments comprising a known genomic locus and 
surrounding sequences (Triglia, T. et al. (1988) Nucleic Acids Res. 16:8186). A third method, capture 
PCR, involves PCR amplification of DNA fragments adjacent to known sequences in human and 
yeast artificial chromosome DNA (Lagerstrom, M. et al. (1991) PCR Methods Applic. 1:111-119). In 
this method, multiple restriction enzyme digestions and ligations may be used to insert an engineered 
10 double-stranded sequence into a region of unknown sequence before performing PCR. Other 

methods which may be used to retrieve unknown sequences are known in the art (Parker, J.D. et al. 
(1991) Nucleic Acids Res. 19:3055-3060). Additionally, one may. use PCR, nested primers, and 
PROMOTERFINDER libraries (Clontech, Palo Alto CA) to walk genomic DNA. This procedure 
avoids the need to screen libraries and is useful in finding intron/exon junctions. For all PCR-based 
15 methods, primers may be designed using commercially available software, such as OOGO 4.06 

primer analysis software (National Biosciences, Plymouth MN) or another appropriate program, to be. 
about 22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to anneal to the 
template at temperatures of about 68°C to 72 P C. 

When screening for full length cDNAs, it is preferable to use libraries that have been 
20 size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5* regions of genes, are preferable for situations in which an oligo d(T) 
library does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence 
into 5' non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze 
25 the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the entire 
30 process from loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 
which may be present in limited amounts in a particular sample. 
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In another embodiment of the invention, polynucleotides or fragments thereof which encode 
TRICH may be cloned in recombinant DNA molecules that direct expression of TRICH, or fragments 
or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy of the 
genetic code, other polynucleotides which encode substantially the same or a functionally equivalent 
polypeptides may be produced and used to express TRICH. 

The polynucleotides of the invention can be engineered using methods generally known in the 
art in order to alter TRICH-encoding sequences for a variety of purposes including, but not limited to, 
modification of the cloning, processing, and/or expression of the gene product DNA shuffling by 
random fragmentation and PCR reassembly of gene fragments and synthetic oligonucleotides may be 
used to engineer the nucleotide sequences. For example, oligonucleotide-mediated site-directed 
mutagenesis may be used to introduce mutations that create new restriction sites, alter glycosylation 
patterns, change codon preference, produce splice variants, and so forth. 

The nucleotides of the present invention may be subjected to DNA shuffling techniques such 
as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent No. 
5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et aL (1999) Nat. 
Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or improve 
the biological properties of TRICH, such as its biological or enzymatic activity or its ability to bind to 
other molecules or compounds. DNA shuffling is a process by which a library of gene variants is 
produced using PCR-mediated recombination of gene fragments. The library is then subjected to 
selection or screening procedures that identify those gene variants with the desired properties. These 
preferred variants may then be pooled and further subjected to recursive rounds of DNA shuffling and 
selection/screening. Thus, genetic diversity is created through "artificial" breeding and rapid molecular 
evolution. For example, fragments of a single gene containing random point mutations may be 
recombined, screened, and then reshuffled until the desired properties are optimized. Alternatively, 
fragments of a given gene may be recombined with fragments of homologous genes in the same gene 
family, either from the same or different species, thereby maximizing the genetic diversity of multiple 
naturally occurring genes in a directed and controllable manner. 

In another embodiment, polynucleotides encoding TRICH may be synthesized, in whole or in 
part, using one or more chemical methods well known in the art (Caruthers, M.H. et al. (1980) 
Nucleic Acids Symp. Ser. 7:215-223; Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232). 
Alternatively, TRICH itself or a fragment thereof may be synthesized using chemical methods known 
in the art. For example, peptide synthesis can be performed using various solution-phase or 
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solid-phase techniques (Creighton, T. (1984) Proteins. Structures and Molecular Properties, WH 
Freeman, New York NY, pp. 55-60; Roberge, J.Y. et al. (1995) Science 269:202-204). Automated 
synthesis may be achieved using the ABI 431A peptide synthesizer (Applied Biosystems). 
Additionally, the amino acid sequence of TRICK, or any part thereof, may be altered during direct 

5 synthesis and/or combined with sequences from other proteins, or any part thereof, to produce a 
variant polypeptide or a polypeptide having a sequence of a naturally occurring polypeptide: 

The peptide may be substantially purified by preparative high performance liquid 
chromatography (Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421). The 
composition of the synthetic peptides may be confirmed by amino acid analysis or by sequencing 

10 (Creighton, supra, pp. 28-53). 

In order to express a biologically active TRICH, the polynucleotides encoding TRICH or 
derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which contains 
the necessary elements for transcriptional and translational control of the inserted coding sequence in 
a suitable host. These elements include regulatory sequences, such as enhancers, constitutive and 

15 inducible promoters, and 5' and 3* untranslated regions in the vector and in polynucleotides encoding 
TRICH. Such elements may vary in their strength and specificity. Specific initiation signals may also 
be used to achieve more efficient translation of polynucleotides encoding TRICH. Such signals 
include the ATG initiation codon and adjacent sequences, e.g. the Kozak sequence. In cases where a 
polynucleotide sequence encoding TRICH and its initiation codon and upstream regulatory sequences 

20 are inserted into the appropriate expression vector, no additional transcriptional or translational control 
signals may be needed. However, in cases where only coding sequence, or a fragment thereof, is 
inserted, exogenous translational control signals including an in-frame ATG initiation codon should be 
provided by the vector. Exogenous translational elements and initiation codons may be of various 
origins, both natural and synthetic. The efficiency of expression may be enhanced by the inclusion of 

25 enhancers appropriate for the particular host cell system used (Scharf, D. et al. (1994) Results Probl. 
Cell Differ. 20:125-162). 

Methods which are well known to those skilled in the art may be used to construct expression 
vectors containing polynucleotides encoding TRICH and appropriate transcriptional and translational 
control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, 

30 and in vivo genetic recombination (Sambrook and Russell, supra, ch. 1-4, and 8; Ausubel et al., 
supra, ch. 1, 3, and 15). 
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A variety of expression vector/host systems may be utilized to contain and express 
polynucleotides encoding TRICH. These include, but are not limited to, microorganisms such as 
bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; 
yeast transformed with yeast expression vectors; insect cell systems infected with viral expression 
5 vectors (e.g., baculovirus); plant cell systems transformed with viral expression vectors (e.g., 

cauliflower mosaic virus, CaMV, or tobacco mosaic virus, TMV) or with bacterial expression vectors 
(e.g., Ti or pBR322 plasmids); or animal cell systems (Sambrook and Russell, supra; Ausubel et al., 
supra; Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 264:5503-5509; Engelhard, E.K. et al. 
(1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937- 

10 1945; Takamatsu, N. (1987) EMBO J. 6:307-311; The McGraw Hill Yearbook of Science and 

Technology (1992) McGraw Hill, New York NY, pp. 191-196; Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad. Sci. USA 81 :3655-3659; Harrington, J.J. et al. (1997) Nat. Genet. 15:345-355). 
Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, or from 
various bacterial plasmids, may be used for delivery of polynucleotides to the targeted organ, tissue, or 

15 cell population (Di Nicola, M. et al. (1998) Cancer Gen. Ther. 5:350-356; Yu, M. et al. (1993) Proc. 
Natl. Acad. Sci. USA 90:6340-6344; Buller, R.M. et al. (1985) Nature 317:813-815; McGregor, D.P. 
et al. (1994) Mol. Immunol. 3 1 :219-226; Verma, I.M. and N. Somia (1997) Nature 389:239-242). The 
invention is not limited by the host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected depending 

20 upon the use intended for polynucleotides encoding TRICH. For example, routine cloning, subcloning, 
and propagation of polynucleotides encoding TRICH can be achieved using a multifunctional E. coli 
vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 plasmid (Invitrogen). 
Ligation of polynucleotides encoding TRICH into the vector's multiple cloning site disrupts the lacZ 
gene, allowing a colorimetric screening procedure for identification of transformed bacteria containing 

25 recombinant molecules. In addition, these vectors may be useful for in vitro transcription, dideoxy 
sequencing, single strand rescue with helper phage, and creation of nested deletions in the cloned 
sequence (Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 264:5503-5509). When large 
quantities of TRICH are needed, e.g. for the production of antibodies, vectors which direct high level 
expression of TRICH may be used. For example, vectors containing the strong, inducible SP6 or T7 

30 bacteriophage promoter may be used. 

Yeast expression systems may be used for production of TRICH. A number of vectors 
containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
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promoters, may be used in the yeast Saccharomyces cerevisiae or Pichia pastoris. In addition, such 
vectors direct either the secretion or intracellular retention of expressed proteins and enable integration 
of foreign polynucleotide sequences into the host genome for stable propagation (Ausubel et al., 
supra; Bitter, G.A. et al. (1987) Methods EnzymoL 153:516-544; Scorer, C.A. et al. (1994) 
Bio/Technology 12:181-184). 

Plant systems may also be used for expression of TRICH. Transcription of polynucleotides 
encoding TRICH may be driven by viral promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 
6:307-31 1). Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock 
promoters may be used (Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. (1984) 
Science 224:838-843; Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105). These constructs 
can be introduced into plant cells by direct DNA transformation or pathogen-mediated transfection 
(The McGraw Hill Yearb ook of Science and Technology (1992) McGraw Hill, New York NY, pp. 
191-196). 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, polynucleotides encoding TRICH may be ligated 
into an adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain 
infective virus which expresses TRICH in host cells (Logan, J. and T. Shenk (1984) Proc. Natl. Acad. 
Sci. USA 81:3655-3659). In addition, transcription enhancers, such as the Rous sarcoma virus (RSV) 
enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV-based vectors 
may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 
DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino polymers, 
or vesicles) for therapeutic purposes (Harrington, J.J. et al. (1997) Nat. Genet. 15:345-355). 

For long term production of recombinant proteins in mammalian systems, stable expression of 
TRICH in cell lines is preferred. For example, polynucleotides encoding TRICH can be transformed 
into cell lines using expression vectors which may contain viral origins of replication and/or 
endogenous expression elements and a selectable marker gene on the same or on a separate vector. 
Following the introduction of the vector, cells may be allowed to grow for about 1 to 2 days in enriched 
media before being switched to selective media. The purpose of the selectable marker is to confer 
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resistance to a selective agent, and its presence allows growth and recovery of cells which 
successfully express the introduced sequences. Resistant clones of stably transformed cells may be 
propagated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines These 
5 include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 

phosphoribosyKransferase genes, for use in * and apr cells, respectively (Wigler, M. et al (1977) 
Cell 11:223-232; Lowy, I. et al. (1980) Cell 22:817-823). Also, antimetabolite, antibiotic, or herbicide 
reastance can be used as the basis for selection. For example, dhfr confers resistance to 
methotrexate; neo confers resistance to fee aminoglycosides neomycin and G-418; and als and pat 
10 confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively (Wigler M et al 
(1980) Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) J. Mol Biol 
150:1-14). Additional selectable genes have been described, e.g., trpB and hisD, which alter cellular 
requnements for metabolites (Hartman, S.C. and R.C. Mulligan (1988) Proc. Natl. Acad. Sci. USA 
85:8047-8051). Visible markers, e.g., anthocyanins, green fluorescent proteins (GFP- Clontech) JJ- 
• 15 glucuronidase and its substrate P -glucuronide, or luciferase and its substrate luciferin may be used 
These markers can be used not only to identify transforms, but also to quantify the amount of 
transit or stable protein expression attributable to a specific vector system (Rhodes, C.A. (1995) 
Methods Mol. Biol. 55:121-131). 

Although the presence/absence of marker gene expression suggests that the gene of interest 
20 -also pre sent,thepre S enceandexpressionofmegenemayneedtobeconfirmed. Forexample if 
the sequence encoding TRICH is inserted within a marker gene sequence, transformed cells 
contacting polynucleotides encoding TRICH can be identified by the absence of marker gene function 
Alternately, a marker gene can be placed in tandem with a sequence encoding TRICH under the 
control of a single promoter. Expression of the marker gene in response to induction or selection 
25 usually indicates expression of the tandem gene as well. 

In general, host cells that contain the polynucleotide encoding TRICH and that express 
TRICH may be identified by a variety of procedures known to those of skill in the art Tnese 
procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PGR 
amplification, and protein bioassay or immunoassay techniques which include membrane, solution or 
30 chip based technologies for the detection and/or quantification of nucleic acid or protein sequences' 
Immunological methods for detecting and measuring the expression of TRICH using either 
specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques 
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include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and 
fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non-interfering epitopes on TRICH is preferred, but a 
competitive binding assay may be employed. These and other assays are well known in the art 
5 (Hampton, R. et al. (1990) Serological Methods, a Laboratory Manual APS Press, St. Paul MN, Sect. 
IV; Coligan, J.E. et al. (1997) Current Pro tocols in Immunology. Greene Pub. Associates and Wiley- 
Interscience, New York NY; Pound, J.D. (1998) Immunochemical Protocols . Humana Press, Totowa 
NJ). 

A wide variety of labels and conjugation techniques are known by those skilled in the art and 

10 may be used in various nucleic acid and amino acid assays. Means for producing labeled hybridization 
or PCR probes for detecting sequences related to polynucleotides encoding TRICH include 
oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled nucleotide. 
Alternatively, polynucleotides encoding TRICH, or any fragments thereof, may be cloned into a vector 
for the production of an mRNA probe. Such vectors are known in the art, are commercially available, 

15 and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase 
such as 17, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety 
of commercially available kits, such as those provided by Amersham Biosciences, Promega (Madison 
WI), and US Biochemical. Suitable reporter molecules or labels which may be used for ease of 
detection include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents, as 

20 well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with polynucleotides encoding TRICH may be cultured under 
conditions suitable for the expression and recovery of the protein from cell culture. The protein 
produced by a transformed cell may be secreted or retained intracellularly depending on the sequence 
and/or the vector used. As will be understood by those of skill in the art, expression vectors containing 

25 polynucleotides which encode TRICH may be designed to contain signal sequences which direct 
secretion of TRICH through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 
inserted polynucleotides or to process the expressed protein in the desired fashion. Such modifications 
of the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, 

30 phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a "prepro" or 
"pro" form of the protein may also be used to specify protein targeting, folding, and/or activity. 
Different host cells which have specific cellular machinery and characteristic mechanisms for 
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post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the 
American Type Culture Collection (ATCC, Manassas VA) and may be chosen to ensure the correct 
modification and processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant polynucleotides 
5 encoding TRICH may be ligated to a heterologous sequence resulting in translation of a fusion protein 
m any of the aforementioned host systems. For example, a chimeric TRICH protein containing a 
heterologous moiety that can be recognized by a commercially available antibody may facilitate the 
screening of peptide libraries for inhibitors of TRICH activity. Heterologous protein and peptide 
moiea.es may also facilitate purification of fusion proteins using commercially available affinity 
10 matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), maltose 

bmding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-myc and 
hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate fusion ' 
protems on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal-chelate resins 
respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity purification of fusion ' 
15 proteins using commercially available monoclonal and polyclonal antibodies that specifically recognize 
these epitope tags. A fusion protein may also be engineered to contain a proteolytic cleavage site 
located between the TRICH encoding sequence and the heterologous protein sequence, so that 
TRICH may be cleaved away from the heterologous moiety following purification. Methods for 
fusion protein expression and purification are discussed in Ausubel et al. (supra, ch. 10 and 16) A 
20 variety of commercially available kits may also be used to facilitate expression and purification of 
fusion proteins. 

In another embodiment, synthesis of radiolabeled TRICH may be achieved in vitro using the 
TNTrabbitreticulocytelysateorwheatgermextractsystem(Promega). These systems couple 
transcription and translation of protein-coding sequences operably associated with the T7, T3 or SP6 
25 promoter. Translation takes place in the presence of a radiolabeled amino acid precursor, for 
example, 35 S-methionine. 

TRICH fragments of TRICH, or variants of TRICH may be used to screen for compounds 
that specifically bind to TRICH. One or more test compounds may be screened for specific binding to 
TRICH. m various embodiments, 1, 2, 3, 4, 5, 10, 20, 50, 100, or 200 test compounds can be screened 
30 for specific binding to TRICH. Examples of test compounds can include antibodies, anticalins, 
oligonucleotides, proteins (e.g., ligands or receptors), or small molecules. 
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In related embodiments, variants of TRICH can be used to screen for binding of test 
compounds, such as antibodies, to TRICH, a variant of TRICH, or a combination of TRICH and/or 
one or more variants TRICH. In an embodiment, a variant of TRICH can be used to screen for 
compounds that bind to a variant of TRICH, but not to TRICH having the exact sequence of a 
5 sequence of SEQ ID NO: 1-59. TRICH variants used to perform such screening can have a range of 
about 50% to about 99% sequence identity to TRICH, with various embodiments having 60%, 70%, 
75%, 80%, 85%, 90%, and 95% sequence identity. 

In an embodiment, a compound identified in a screen for specific binding to TRICH can be 
closely related to the natural ligand of TRICH, e.g., a ligand or fragment thereof, a natural substrate, a 

10 structural or functional mimetic, or a natural binding partner (Coligan, J.E. et al. (1991) Current 

Protocols in Immunology l(2):Chapter 5). In another embodiment, the compound thus identified can 
■ be a natural ligand of a receptor TRICH (Howard, A.D. et al. (2001) Trends Pharmacol. Sci.22:132- 
140; Wise, A. et al. (2002) Drug Discovery Today 7:235-246). 

In other embodiments, a compound identified in a screen for specific binding to TRICH can be 

15 closely related to the natural receptor to which TRICH binds, at least a fragment of the receptor, or a 
fragment of the receptor including all or a portion of the ligand binding site or binding pocket. For 
example, the compound may be a receptor for TRICH which is capable of propagating a signal, or a 
' decoy receptor for TRICH which is not capable of propagating a signal (Ashkenazi, A. and V.M. 
Divit (1999) Curr. Opin. Cell Biol. 11:255-260; Mantovani, A. et al. (2001) Trends Immunol. 22:328- 

20 336). The compound can be rationally designed using known techniques. Examples of such 
techniques include those used to construct the compound etanercept (ENBREL; Amgen Inc., 
Thousand Oaks CA), which is efficacious for treating rheumatoid arthritis in humans. Etanercept is 
an engineered p75 tumor necrosis factor (TOT) receptor dimer linked to the Fc portion of human 
(Taylor, P.C. et al. (2001) Curr. Opin. Immunol. 13:611-616). 

25 In one embodiment, two or more antibodies having similar or, alternatively, different 

specificities can be screened for specific binding to TRICH, fragments of TRICH, or variants of 
TRICH. The binding specificity of the antibodies thus screened can thereby be selected to identify 
particular fragments or variants of TRICH. In one embodiment, an antibody can be selected such that 
its binding specificity allows for preferential identification of specific fragments or variants of TRICH. 

30 In another embodiment, an antibody can be selected such that its binding specificity allows for 
preferential diagnosis of a specific disease or condition having increased, decreased, or otherwise 
abnormal production of TRICH. 
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TRICH, fragments of TRICH, or variants of TRICH may be used to screen for compounds 
that modulate the activity of TRICH. Such compounds may include agonists, antagonists, or partial or 
inverse agonists. In one embodiment, an assay is performed under conditions permissive for TRICH 
activity, wherein TRICH is combined with at least one test compound, and the activity of TRICH in 

5 the presence of a test compound is compared with the activity of TRICH in the absence of the test 
compound. A change in the activity of TRICH in the presence of the test compound is indicative of a 
compound that modulates the activity of TRICH. Alternatively, a test compound is combined with an 
in vitro or cell-free system comprising TRICH under conditions suitable for TRICH activity, and the 
assay is performed. In either of these assays, a test compound which modulates the activity of 

10 TRICH may do so indirectly and need not come in direct contact with the test compound. At least 
one and up to a plurality of test compounds may be screened. 

In another embodiment, polynucleotides encoding TRICH or their mammalian homologs may 
be 'Toiocked out" in an animal model system using homologous recombination in embryonic stem (ES) 
cells. Such techniques are well known in the art and are useful for the generation of animal models of 

15 human disease (see, e.g., U.S. Patent No. 5,175,383 and U.S; Patent No. 5,767,337). For example, 
mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the early mouse embryo and 
grown in culture. The ES cells are transformed with a vector containing the gene of interest disrupted • 
by a marker gene, e.g., the neomycin phosphotransferase gene (neo\ Capecchi, M.R. (1989) Science 
244: 1288-1292). The vector integrates into the corresponding region of the host genome by 

20 homologous recombination. Alternatively, homologous recombination takes place using the Cre-loxP 
system to knockout a gene of interest in a tissue- or developmental stage-specific manner (Marth, J.D. 
(1996) Clin. Invest. 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic Acids Res. 25:4323-4330). . 
Transformed ES cells are identified and microinjected into mouse cell blastocysts such as those from 
the C57BL/6 mouse strain. The blastocysts are surgically transferred to pseudopregnant dams, and 

25 the resulting chimeric progeny are genotyped and bred to produce heterozygous or homozygous 
strains. Transgenic animals thus generated may be tested with potential therapeutic or toxic agents. 

Polynucleotides encoding TRICH may also be manipulated in vitro in ES cells derived from 
human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 

30 into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et al. 
(1998) Science 282:1145-1147). 
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Polynucleotides encoding TRICH can also be used to create "knockin" humanized animals 
(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a region 
of a polynucleotide encoding TRICH is injected into animal ES cells, and the injected sequence 
integrates into the animal cell genome. Transformed cells are injected into blastulae, and the blastulae 
5 are implanted as described above. Transgenic progeny or inbred hues are studied and treated with 
potential pharmaceutical agents to obtain information on treatment of a human disease. Alternatively, 
a mammal inbred to overexpress TRICH, e.g., by secreting TRICH in its milk, may also serve as a 
convenient source of that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 4:55-74). 
THERAPEUTICS 

10 Chemical and structural similarity, e.g., in the context of sequences and motifs, exists between 

regions of TRICH and transporters and ion channels. In addition, examples of tissues expressing 
TRICH can be found in Table 6 and can also be found in Example XI. Therefore, TRICH appears to 
playarolein transport, neurological, muscle, immunological and cell proliferative disorders. In the 
treatment of disorders associated with increased TRICH expression or activity, it is desirable to 

15 decreasetheexpressionoractivityofTRICH. In the treatment of disorders associated with 
decreased TRICH expression or activity, it is desirable to increase the expression or activity of 
TRICH. 

Therefore, in one embodiment, TRICH or a fragment or derivative thereof may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or . . 
20 activity of TRICH. Examples of such disorders include, but are not limited to, a transport disorder 
such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, Becker's muscular 
dystrophy, Bell's palsy, Charcot-Marie Tooth disease, diabetes melhtus, diabetes insipidus, diabetic 
neuropathy, Duchenne muscular dystrophy, hypokalemic periodic paralysis, normokalemic periodic 
paralysis, Parkinson's disease, malignant hyperthermia, multidrug resistance, myasthenia gravis, 
25 myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral neuropathy, cerebral 

neoplasms, prostate cancer, cardiac disorders associated with transport, e.g., angina, bradyanythmia, 
tachyarrythmia, hypertension, Long QT syndrome, myocarditis, cardiomyopathy, nemaline myopathy, 
centronuclear myopathy, lipid myopathy, mitochondrial myopathy, thyrotoxic myopathy, ethanol 
myopathy, dermatomyositis, inclusion body myositis, infectious myositis, polymyositis, neurological 
30 disorders associated with transport, e.g., Alzheimer's disease, amnesia, bipolar disorder, dementia, 
depression, epilepsy, Tourette's disorder, paranoid psychoses, and schizophrenia, and other disorders 
associated with transport, e.g., neurofibromatosis, postherpetic neuralgia, trigeminal neuropathy, 
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sarcoidosis, sickle cell anemia, Wilson's disease, cataracts, infertility, pulmonary artery stenosis, 
sensorineural autosomal deafness, hyperglycemia, hypoglycemia, Grave's disease, goiter, Gushing' s 
disease, Addison's disease, glucose-galactose malabsorption syndrome, glycogen storage disease, 
hypercholesterolemia, adrenoleukodystrophy, Zellweger syndrome, Menkes disease, occipital horn 
5 syndrome, von Gierke disease, pseudohypoaldosteronism type 1, Liddle's syndrome, cystinuria, 
iminoglycinuria, Hartup disease, Fanconi disease, and Bartter syndrome; a neurological disorder such 
as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's 
disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, 
amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular atrophy, 

10 retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating diseases, bacterial 
and viral meningitis, brain abscess, subdural empyema, epidural abscess, suppurative intracranial 
thrombophlebitis, myelitis and radiculitis, viral central nervous system disease, prion diseases including 
kuru, Creutzfeldt- Jakob disease, and Gerstmann-Straussler-Scheinker syndrome, fatal familial 
insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, tuberous 

15 sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal syndrome, mental retardation 
and other developmental disorders of the central nervous system including Down syndrome, cerebral . 
palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial nerve disorders, spinal cord 
diseases, muscular dystrophy and other neuromuscular disorders, peripheral nervous system disorders, 
dermatomyositis and polymyositis, inherited, metabolic, endocrine, and toxic myopathies, myasthenia 

20 gravis, periodic paralysis, mental disorders including mood, anxiety, and schizophrenic disorders, 
seasonal affective disorder (SAD), akathesia, amnesia, catatonia, diabetic neuropathy, hemiplegic 
migraine, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, Tourette's disorder, 
progressive supranuclear palsy, corticobasal degeneration, and familial frontotemporal dementia; a 
muscle disorder such as cardiomyopathy, myocarditis, Duchenne's muscular dystrophy, Becker's 

25 muscular dystrophy, myotonic dystrophy, central core disease, nemaline myopathy, centronuclear 

myopathy, lipid myopathy, mitochondrial myopathy, infectious myositis, polymyositis, dermatomyositis, 
inclusion body myositis, thyrotoxic myopathy, ethanol myopathy, angina, anaphylactic shock, 
arrhythmias, asthma, cardiovascular shock, Cushing's syndrome, hypertension, hypoglycemia, 
myocardial infarction, migraine, pheochromocytoma, and myopathies including encephalopathy, 

30 epilepsy, Kearns-Sayre syndrome, lactic acidosis, myoclonic disorder, ophthalmoplegia, acid maltase 
deficiency (AMD, also known as Pompe's disease), generalized myotonia, and myotonia congenita; an 
immunological disorder such as acquired immunodeficiency syndrome (AIDS), Addison's disease, 
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adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, 
atherosclerosis, antoimmnne hemolytic anemia, autoimmune thyroiditis, autoimmune 
polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 
dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, episodic 
5 lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic gastritis 
glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's thyroiditis 
hypereosinophiUa, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, myocardial or 
pencardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's 
syndrome, rheumatoid arthritis, scleroderma, SjSgren's syndrome, systemic anaphylaxis, systemic 
10 lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis Werner 
syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial 
fungal, parasitic, protozoal, and helminthic infections, and trauma; and a cell proliferative disorder such 
as actimc keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective 
tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, 
psoriasis, primary mrombocythemia, and cancers including adenocarcinoma, leukemia, lymphoma, , 
melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, 
bladder, bone, bone marrow, brain, breast, cervix, colon, gall bladder, ganglia, gastrointestinal tract 
heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, 
spleen, testis, thymus, thyroid, and uterus. 

In another embodiment, a vector capable of expressing TRICH or a fragment or derivative 
thereof may be administered to a subject to treat or prevent a disorder associated with decreased 
expression or activity of TRICH including, but not limited to, those described above. 

In a further embodiment, a composition comprising a substantially purified TRICH in 
conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or prevent 
a dxsorder associated with decreased expression or activity of TRICH including, but not limited to, 
those provided above. 

In still another embodiment, an agonist which modulates the activity of TRICH may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of TRICH including, but not limited to, those listed above. 

In a further embodiment, an antagonist of TRICH may be administered to a subject to treat or 
prevent a disorder associated with increased expression or activity of TRICH. Examples of such 
disorders include, but are not limited to, those transport, neurological, muscle, immunological and cell 
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proliferative disorders described above. In one aspect, an antibody which specifically binds TRICH 

may be used directly as an antagonist or indirectly as a targeting or delivery mechanism for bringing a 

pharmaceutical agent to cells or tissues which express TRICH. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
5 encoding TRICH may be administered to a subject to treat or prevent a disorder associated with 

increased expression or activity of TRICH including, but not limited to, those described above. 

In other embodiments, any protein, agonist, antagonist, antibody, complementary sequence, or 

vector embodiments may be administered in combination with other appropriate therapeutic agents. 

Selection of the appropriate agents for use in combination therapy may be made by one of ordinary 
10 skill in the art, according to conventional pharmaceutical principles. The combination of therapeutic 

agents may act synergistically to effect the treatment or prevention of the various disorders described 

above. Using this approach, one may be able to achieve therapeutic efficacy with lower dosages of 

each agent, thus reducing the potential for adverse side effects. 

An antagonist of TRICH may be produced using methods which are generally known in the 
15 art. In particular, purified TRICH may be used to produce antibodies or to screen libraries of 

pharmaceutical agents to identify those which specifically bind TRICH. Antibodies to TRICH may 

also be generated using methods that are well known in the art. Such antibodies may include, but are . 

not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and 

fragments produced by a Fab expression library. In an embodiment, neutralizing antibodies (i.e., those 
20 which inhibit dimer formation) can be used therapeutically. Single chain antibodies (e.g., from camels 

or llamas) may be potent enzyme inhibitors and may have application in the design of peptide mimetics, 

and in the development of immuno-adsorbents and biosensors (Muyldermans, S. (2001) J. Biotechnol. 

74:277-302). ' 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, camels, 
25 dromedaries, llamas, humans, and others may be immunized by injection with TRICH or with any 
fragment or oligopeptide thereof which has immunogenic properties. Depending on the host species, 
various adjuvants may be used to increase immunological response. Such adjuvants include, but are 
not limited to, Freund's, mineral gels such as aluminum hydroxide, and surface active substances such 
as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among 
30 adjuvants used in humans, BCG (bacilli Calmetfe-Guerin) and Corynebacteriwn parvum are 
especially preferable. 
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It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 
TRICH have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 
fragments are substantially identical to a portion of the amino acid sequence of the natural protein. 
5 Short stretches of TRICH amino acids may be fused with those of another protein, such as KLH, and 
antibodies to the chimeric molecule may be produced. 

Monoclonal antibodies to TRICH may be prepared using any technique which provides for the 
production of antibody molecules by continuous cell lines in culture. These include, but are not limited 
to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma 
10 technique (Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. Immunol. Methods 
81:31-42; Cote, RJ. et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; Cole, S.P. et al. (1984) 
Mol. Cell Biol. 62:109-120). 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibodygenes to obtain a molecule with appropriate 
15 antigen specificity and biological activity, can be used (Morrison, S.L. et al. (1984) Proc. Natl. Acad, 
Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; Takeda, S. et al. (1985) 
Nature 314:452-454). Alternatively, techniques described for the production of single chain antibodies 
may be adapted, using methods known in the art, to produce TRICH-specific single chain antibodies. 
Antibodies with related specificity, but of distinct idiotypic composition, may be generated by chain 
20 shuffling from random combinatorial immunoglobulin libraries (Burton, D.R. (1991) Proc. Natl. Acad; 
Sci. USA 88:10134-10137). 

Antibodies may also be produced by inducing in vivo production in the lymphocyte population 
or by screening immunoglobuhn libraries or panels of highly specific binding reagents as disclosed in 
the literature (Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 86:3833-3837; Winter, G. et al. 
25 (1991) Nature 349:293-299). 

Antibody fragments which contain specific binding sites for TRICH may also be generated. 
For example, such fragments include, but are not limited to, F(ab') 2 fragments produced by pepsin 
digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 
the F(ab')2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and 
30 easy identification of monoclonal Fab fragments with the desired specificity (Huse, W.D. et al. (1989) 
Science 246:1275-1281). 
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Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 
immunoassays typically involve the measurement of complex formation between TRICH and its 
5 specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive 
to two non-interfering TRICH epitopes is generally used, but a competitive binding assay may also be 
employed (Pound, supra). 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay techniques 
may be used to assess the affinity of antibodies for TRICH. Affinity is expressed as an association 

10 constant, K^, which is defined as the molar concentration of TRICH-antibody complex divided by the 
molar concentrations of free antigen and free antibody under equilibrium conditions. The 
determined for a preparation of polyclonal antibodies, which are heterogeneous in their affinities for 
multiple TRICH epitopes, represents the average affinity, or avidity, of the antibodies for TRICH. 
The Ka determined for a preparation of monoclonal antibodies, which are monospecific for a particular 

15 TRICH epitope, represents a true measure of affinity. High-affinity antibody preparations with 
ranging from about 10 9 to 10 12 L/mole are preferred for use in immunoassays in which the TRICH- 
antibody complex must withstand rigorous manipulations. Low-affinity antibody preparations with 
ranging from about 10 6 to 10 7 L/mole are preferred for use in immunopurification and similar 
procedures which ultimately require dissociation of TRICH, preferably in active form, from the 

20 antibody (Catty, D. (1988) Antibodies. Volume I: A Practical Approach , IRL Press, Washington DC; 
Liddell, J.E. and A. Oyer (1991) A Practical Guide to Monoclonal Antibodies . John Wiley & Sons, 
New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to determine 
the quality and suitability of such preparations for certain downstream applications. For example, a 
25 polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 5-10 mg 
specific antibody/ml, is generally employed in procedures requiring precipitation of TRICH-antibody 
complexes. Procedures for evaluating antibody specificity, titer, and avidity, and guidelines for 
antibody quality and usage in various applications, are generally available (Catty, supra\ Coligan et aL, 
supra). 

30 In another embodiment of the invention, polynucleotides encoding TRICH, or any fragment or 

complement thereof, may be used for therapeutic purposes. In one aspect, modifications of gene 
expression can be achieved by designing complementary sequences or antisense molecules (DNA, 
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express a condidonany tema, gene product (e.g., m dte ease of cancers which result from undated 
ceU profiferadon,, or flii, express , protein which affords protecdon against inteaceUular pnraaitea (e g 
aganta.hu^nredoviruaea.sechashunm^ D ' " 

Natere 335:395-396; Rouble, E. e, a,. (,996) Pane. Nad. Acad Sci. USA 93:„395-,,399, hepadds 
B orC vinjs (HBV, HCV); fungal parasites, such as CM* ^tm and A™****, 

and pKUozoan parasite, such as P lasmodiumMclpmm Trypmosoma ^ „ 
dte case whom a genedc deficiency in TRICH expeasion or reguhmon cauaea disease, tee expression 
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of TRICH from an appropriate population of transduced cells may alleviate the clinical manifestations 
caused by the genetic deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in 
TRICH are treated by constructing mammalian expression vectors encoding TRICH and introducing 
5 these vectors by mechanical means into TRICH-deficient cells. Mechanical transfer technologies for 
use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) ballistic 
gold particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene transfer, and 
(v) the use of DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Annu. Rev. Biochem. 
62:191-217; Ivies, Z. (1997) Cell 91:501-510; Boulay, J.-L. and H. Recipon (1998) Curr. Opin. 

10 Biotechnol. 9:445-450). 

Expression vectors that may be effective for the expression of TRICH include, but are not 
limited to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX, PCR2-TOPOTA vectors 
(Invitrogen, Carlsbad CA), PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA), 
and PTET-OFF, PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). TRICH 

15 may be expressed using (i) a constitutively active promoter, (e.g., from cytomegalovirus (CMV), Rous 
sarcoma virus (RSV), SV40 virus, thymidine kinase (TK), or P-actin genes), (ii) an inducible promoter 
(e.g., the.tetracycline-regulated promoter (Gossen, M. and H. Bujard (1992) Proc. Natl. Acad. Sci. 
USA 89:5547-5551; Gossen, M. et al. (1995) Science 268:1766-1769; Rossi, F.M.V. andHJM. Blau 
(1998) Curr. Opin. Biotechnol. 9:451-456), commercially available in the T-REX plasmid (Invitrogen)); 

20 the ecdysone-inducible promoter (available in the plasmids PVGRXR and PIND; Invitrogen); the 
FK506/rapamycin inducible promoter, or the RU486/mifepristone inducible promoter (Rossi, F.M.V. 
and H.M. Blau, supra)), or (iii) a tissue-specific promoter or the native promoter of the endogenous 
gene encoding TRICH from a normal individual. 

Commercially 'available liposome transformation kits (e.g., the PERFECT LIPID 

25 TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optimize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, F.L. and A.J. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 
(1982) EMBO J. 1:841-845). The introduction of DNA to primary cells requires modification of these 

30 standardized mammalian transfection protocols. 

In another embodiment of the invention, diseases or disorders caused by genetic defects with 
respect to TRICH expression are treated by constructing a retrovirus vector consisting of (i) the 
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polynucleotide encoding TRICH under the control of an independent promoter or the retrovirus long 
terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 
element (RRE) along with additional retrovirus m-acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 
5 commercially available (Stratagene) and are based on published data (Riviere, I. et al. (1995) Proc. 
Natl. Acad. Sci. USA 92:6733-6737), incorporated by reference herein. The vector is propagated in 
an appropriate vector producing cell line (VPCL) that expresses an envelope gene with a (ropism for 
receptors on the target cells or a promiscuous envelope protein such as VSVg (Annentano, D. et al. 
(1987) J. Virol. 61 :1647-1650; Bender, MA. et al. (1987) J. Virol. 61:1639-1646; Adam, M.A. and 
10 A.D. MUler (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. et 
al. (1998) J. Virol. 72:9873-9880). U.S. Patent No. 5,910,434 to Rigg ("Method for obtaining 
retrovirus packaging cell lines producing high transducing efficiency retroviral supematanf ') discloses 
a method for obtaining retrovirus packaging cell lines and is hereby incorporated by reference. 
Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4* T-cells), and the 
15 return of transduced cells to a patient are procedures well-known to persons skilled in the art of gene 
therapy and have been well documented (Ranga, U. et al. (1997) J. Virol. 71:7020-7029; Bauer, G. et 
al. (1.997) Blood 89:2259-2267; Bonyhadi, M.L. (1997) J. Virol. 71:4707-4716; Ranga, U. et ah (1998) 
Proc. Nad. Acad. Sci. USA 95:1201-1206; Su, L. (1997) Blood 89:2283-2290). 

In an embodiment, an adenovirus-based gene therapy delivery system is used to deliver 
20 polynucleotides encoding TRICH to cells which have one or more genetic abnormalities with respect 
to the expression of TRICH. The construction and packaging of adenovirus-based vectors are well 
known to those with ordinary skill in the art. Replication defective adenovirus vectors have proven to 
be versatile for importing genes encoding immunoregulatory proteins into intact islets in the pancreas 
(Csete, M.E. et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are 
25 described in U.S. Patent No. 5,707,618 to Annentano ("Adenovirus vectors for gene therapy"), 

hereby incorporated by reference. For adenoviral vectors, see also Antinozzi, P.A. et al. (1999; Annu. 
Rev. Nutr. 19:511-544) and Verma, I.M. and N. Somia (1997; Nature 18:389:239-242). 

In another embodiment, a herpes-based, gene therapy delivery system is used to deliver 
polynucleotides encoding TRICH to target cells which have one or more genetic abnormalities with 
30 respect to the expression of TRICH. The use of herpes simplex virus (HSV)-based vectors may be 
especially valuable for introducing TRICH to cells of the central nervous system, for which HS V has 
a tropism. The construction and packaging of heipes-based vectors are well known to those with 
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ordinary skill in the art A replication-competent heipes simplex virus (HSV) type 1 -based vector has 
been used to deliver a reporter gene to the eyes of primates (Liu, X. et aL (1999) Exp. Eye Res. 
169:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail in U.S. 
Patent No. 5,804,413 to DeLuca ("Herpes simplex virus strains for gene transfer"), which is hereby 
5 incorporated by reference. U.S. Patent No. 5,804,413 teaches the use of recombinant HSV d92 
which consists of a genome containing at least one exogenous gene to be transferred to a cell under 
the control of the appropriate promoter for purposes including human gene therapy. Also taught by 
this patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 and 
ICP22. For HSV vectors, see also Goins, W.F. et al. (1999; J. Virol. 73:519-532) and Xu, H. et al. 

10 (1994; Dev. Biol. 163:152-161). The manipulation of cloned herpesvirus sequences, the generation of 
recombinant virus following the transfection of multiple plasmids containing different segments of the 
large herpesvirus genomes, the growth and propagation of herpesvirus, and the infection of cells with 
herpesvirus are techniques well known to those of ordinary skill in the art. 

In another embodiment, an alphavirus (positive, single-stranded RNA virus) vector is used to 

15 deliver polynucleotides encoding TRICH to target cells. The biology of the prototypic alphavirus, 
Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based; 
on the SFV genome (Garoff, H. and K.-L Li (1998) Curr. Opin. Biotechnol. 9:464-469). During 
alphavirus RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid 
proteins. This subgenomic RNA replicates to higher levels than the full length genomic RNA, 

20 resulting in the overproduction of capsid proteins relative to the viral proteins with enzymatic activity 
(e.g., protease and polymerase). Similarly, inserting the coding sequence for TRICH into the 
alphavirus genome in place of the capsid-coding region results in the production of a large number of 
TRICH-coding RNAs and the synthesis of high levels of TRICH in vector transduced cells. While 
alphavirus infection is typically associated with cell lysis within a few days, the ability to establish a 

25 persistent infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) 
indicates that the lytic replication of alphaviruses can be altered to suit the needs of the gene therapy 
application (Dryga, S.A. et al. (1997) Virology 228:74-83). Hie wide host range of alphaviruses will 
allow the introduction of TRICH into a variety of cell types. The specific transduction of a subset of 
cells in a population may require the sorting of cells prior to transduction. The methods of 

30 manipulating infectious cDNA clones of alphaviruses, performing alphavirus cDNA and RNA 

transfections, and performing alphavirus infections, are well known to those with ordinary skill in the 
art. 
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Oligonucleotides derived from the transcription initiation site, e.g., between about positions -10 
and +10 from the start site, may also be employed to inhibit gene expression. Similarly, inhibition can 
be achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it causes 
inhibition of the ability of the double helix to open sufficiendy for the binding of polymerases, 
5 transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have 
been described in the literature (Gee, J.E. et al. (1994) in Huber, B.E. and B.I. Can, MQlecularand 
Immunologic A PP ™che.s, Futura Publishing, ML Kisco NY, pp. 163-177). A complementary 
sequence or antisense molecule may also be designed to block translation of mRNA by preventing the 
transcript from binding to ribosomes. 
10 Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 

RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of RNA molecules encoding TRICH. 
15 Specific ribozyme cleavage sites within any potential RNA target are initially identified by - . 

scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, • 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secendary structural features which may render the oligonucleotide inoperable. The suitability of 
20 candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes may be prepared by any method 
known in the art for the synthesis of nucleic acid molecules. These include techniques for chemically 
synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. Alternatively, 
25 RNA molecules may be generated by in vitro and in vivo transcription of DNA molecules encoding 
TRICH. Such DNA sequences may be incorporated into a wide variety of vectors with suitable RNA 
polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that synthesize 
complementary RNA, constitutively or inducibly, can be introduced into cell lines, cells, or tissues. 
RNA molecules may be modified to increase intracellular stability and half-life. Possible 
30 modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' ends 
of the molecule, or the use of phosphorothioate or 2' O-methyl rather lhan phosphodiesterase linkages 
within the backbone of the molecule. This concept is inherent in the production of PNAs and can be 
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extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, 
and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, 
guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases. 

In other embodiments of the invention, the expression of one or more selected polynucleotides 

5 of the present invention can be altered, inhibited, decreased, or silenced using RNA interference 
(RNAi) or post-transcriptional gene silencing (PTGS) methods known in the art. RNAi is a post- 
transcriptional mode of gene silencing in which double-stranded RNA (dsRNA) introduced into a 
targeted cell specifically suppresses the expression of the homologous gene (i.e., the gene bearing the 
sequence complementary to the dsRNA). This effectively knocks out or substantially reduces the 

10 expression of the targeted gene. PTGS can also be accomplished by use of DNA or DNA fragments 
as well. RNAi methods are described by Fire, A. et al. (1998; Nature 391:806-811) and Gura, T. 
(2000; Nature 404:804-808). PTGS can also be initiated by introduction of a complementary segment 
of DNA into the selected tissue using gene delivery and/or viral vector delivery methods described 
herein or known in the art. 

15 RNAi can be induced in mammalian cells by the use of small interfering RNA also known as 

siRNA. SiRNA are shorter segments of dsRNA (typically about 21 to 23 nucleotides in length) that 
result in vivo from cleavage of introduced dsRNA by the action of an endogenous ribonuclease. 
SiRNA appear to be the mediators of the RNAi effect in mammals. The most effective siRNAs 
appear to be 21 nucleotide dsRNAs with 2 nucleotide 3' overhangs. The use of siRNA for inducing 

20 RNAi in mammalian cells is described by Elbashir, S.M. et al. (2001; Nature 41 1:494-498). 

SiRNA can either be generated indirectly by introduction of dsRNA into the targeted cell, or 
directly by mammalian transfection methods and agents described herein or known in the art (such as 
liposome-mediated transfection, viral vector methods, or other polynucleotide delivery/introductory 
methods). Suitable SiRNAs can be selected by examining a transcript of the target polynucleotide 

25 (e.g., mRNA) for nucleotide sequences downstream from the AUG start codon and recording the 

occurrence of each nucleotide and the 3' adjacent 19 to 23 nucleotides as potential siRNA target sites, 
with sequences having a 21 nucleotide length being preferred. Regions to be avoided for target 
siRNA sites include the 5' and 3' untranslated regions (UTRs) and regions near the start codon (within 
75 bases), as these may be richer in regulatory protein binding sites. UTR-binding proteins and/or 

30 translation initiation complexes may interfere with binding of the siRNP endonuclease complex. The 
selected target sites for siRNA can then be compared to the appropriate genome database (e.g., 
human, etc.) using BLAST or other sequence comparison algorithms known in the art. Target 
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sequences with significant homology to other coding sequences can be eliminated fiom consideration. 
The selected SiRNAs can be produced by chemical synthesis methods known in the art or by in vitro 
transcription using commercially available methods and kits such as the SILENCER siRNA 
construction kit (Ambion, Austin TX). 

In alternative embodiments, long-term gene silencing and/or RNAi effects can be induced in 
selected tissue using expression vectors that continuously express siRNA. This can be accomplished 
using expression vectors that are engineered to express hairpin RNAs (shRNAs) using methods 
known in the art (see, e.g., Brummelkamp, T.R. et al. (2002) Science 296:550-553; and Paddison, PJ. 
et al. (2002) Genes Dev. 16:948-958). In these and related embodiments, shRNAs can be delivered to 
target cells using expression vectors known in the art. An example of a suitable expression vector for 
delivery of siRNA is the PSHJENCER1.0-U6 (circular) plasmid (Ambion). Once delivered to the 
target tissue, shRNAs are processed in vivo into siRNA-like molecules capable of carrying out gene- 
specific silencing. 

In various embodiments, the expression levels of genes targeted by RNAi or PTGS methods 
can be determined by assays for mRNA and/or protein analysis. Expression levels of the mRNA of a 
targeted gene, can be determined by northern analysis methods using, for example, the 
NORTHERNMAX-GLY kit (Ambion); by microarray methods; by PCR methods; by real time PCR 
methods; and by other RNA/polynucleotide assays known in the art or described herein. Expression, 
levels of the protein encoded by the targeted gene can be determined by Western analysis using 
standard techniques known in the art. 

An additional embodiment of the invention encompasses a method for screening for a 
compound which is effective in altering expression of a polynucleotide encoding TRICH. Compounds 
which may be effective in altering expression of a specific polynucleotide may include, but are not 
limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming oligonucleotides, 
transcription factors and other polypeptide transcriptional regulators, and non-macromolecular 
chemical entities which are capable of interacting with specific polynucleotide sequences. Effective 
compounds may alter polynucleotide expression by acting as either inhibitors or promoters of 
polynucleotide expression. Thus, in the treatment of disorders associated with increased TRICH 
expression or activity, a compound which specifically inhibits expression of the polynucleotide 
encoding TRICH may be therapeutically useful, and in the treatment of disorders associated with 
decreased TRICH expression or activity, a compound which specifically promotes expression of the 
polynucleotide encoding TRICH may be therapeutically useful. 
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In various embodiments, one or more test compounds may be screened for effectiveness in 
altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known in the art, including chemical modification of a compound known to be effective in 
altering polynucleotide expression; selection from an existing, commercially-available or proprietary 

5 library of naturally-occurring or non-natural chemical compounds; rational design of a compound 
based on chemical and/or structural properties of the target polynucleotide; and selection from a 
library of chemical compounds created combinatorially or randomly. A sample comprising a 
polynucleotide encoding TRICH is exposed to at least one test compound thus obtained. The sample 
may comprise, for example, an intact or permeabilized cell, or an in vitro cell-free or reconstituted 

10 biochemical system. Alterations in the expression of a polynucleotide encoding TRICH are assayed 
by any method commonly known in the art. Typically, the expression of a specific nucleotide is 
detected by hybridization with a probe having a nucleotide sequence complementary to the sequence 
of the polynucleotide encoding TRICH. The amount of hybridization may be quantified, thus forming 
the basis for a comparison of the expression of the polynucleotide both with and without exposure to 

15 one or more test compounds. Detection of a change in the expression of a polynucleotide exposed to 
a test compound indicates that the test compound is effective in altering the expression of the 
polynucleotide. A screen for a compound effective in altering expression of a specific polynucleotide 
can be carried out, for example, using a Schizosaccharomyces pombe gene expression system 
(Atkins, D. et al. (1999) U.S. Patent No. 5,932,435; Arndt, G.M. et al. (2000) Nucleic Acids Res. 

20 28:E15) or a human cell line such as HeLa cell (Clarke, M.L. et al. (2000) Biochem. Biophys. Res. 
Commun. 268:8-13). A particular embodiment of the present invention involves screening a 
combinatorial library of oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide nucleic 
acids, and modified oligonucleotides) for antisense activity against a specific polynucleotide sequence 
(Bruice, T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. Patent No. 

25 6,022,691). 

Many methods for introducing vectors into cells or tissues are available and equally suitable 
for use in vivo, in vitro, and ex vivo. For ex vivo therapy, vectors may be introduced into stem cells 
taken from the patient and clonally propagated for autologous transplant back into that same patient. 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
30 using methods which are well known in the art (Goldman, C.K. et al. (1997) Nat. Biotechnol. 15:462- 
466). 



72 



WO 03/083085 



PCT/US03/09797 



Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

An additional embodiment of the invention relates to the administration of a composition which 
generally comprises an active ingredient formulated with a pharmaceutical^ acceptable excipient. 
Excipients may include, for example, sugars, starches, celluloses, gums, and proteins. Various 
formulations are commonly known and are thoroughly discussed in the latest edition of Remington's 
Pharmaceutical Sciences (Maack Publishing, Easton PA). Such compositions may consist of TRICH, 
antibodies to TRICH, and mimetics, agonists, antagonists, or inhibitors of TRICH. 

In various embodiments, the compositions described herein, such as pharmaceutical 
compositions, may be administered by any number of routes including, but not limited to, oral, 
intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, intraventricular, pulmonary, 
transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, sublingual, or rectal means. 

Compositions for pulmonary administration may be prepared in liquid or dry powder form. 
These compositions are generally aerosolized immediately prior to inhalation by the patient In the - 
case of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of fast-, 
acting formulations is well-known in the art. In the case of macromolecules (e.g. larger peptides and , 
proteins), recent developments in the field of pulmonary delivery via the alveolar region of the lung ■ 
have enabled the practical delivery of drugs such as insulin to blood circulation (see, e.g., Patton, J.S. 
et al.i U.S. Patent No. 5,997,848). Pulmonary delivery allows administration without needle injection, 
and obviates the need for potentially toxic penetration enhancers. 

Compositions suitable for use in the invention include compositions wherein the active 
ingredients are contained in an effective amount to achieve the intended purpose. The determination 
of an effective dose is well within the capability of those skilled in the art. 

Specialized forms of compositions may be prepared for direct intracellular delivery of 
macromolecules comprising TRICH or fragments thereof. For example, liposome preparations 
containing a ceU-impermeable macromolecule may promote cell fusion and intracellular delivery of the 
macromolecule. Alternatively, TRICH or a fragment thereof may be joined to a short cationic N- 
terminal portion from the HIV Tat-1 protein. Fusion proteins thus generated have been found to 
transduce into the cells of all tissues, including the brain, in a mouse model system (Schwarze, S.R. et 
al. (1999) Science 285:1569-1572). 
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For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, monkeys, 
or pigs. An animal model may also be used to determine the appropriate concentration range and 
route of administration. Such information can then be used to determine useful doses and routes for 

5 administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 
TRICH or fragments thereof, antibodies of TRICH, and agonists, antagonists or inhibitors of TRICH, 
which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined 
by standard pharmaceutical procedures in cell cultures or with experimental animals, such as by 

10 calculating the ED 50 (the dose therapeutically effective in 50% of the population) or LD S0 (the dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the LD 5( /ED 50 ratio. Compositions which exhibit large 
therapeutic indices are preferred. The data obtained from cell culture assays and animal studies are 
used to formulate a range of dosage for human use. The dosage contained in such compositions is 

15 preferably within a range of circulating concentrations that includes the ED 50 with litde or no toxicity. 
The dosage varies within this range depending upon the dosage form employed, the sensitivity of the 
patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the 
subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the 

20 active moiety or to maintain the desired effect. Factors which may be taken into account include the 
severity of the disease state, the general health of the subject, the age, weight, and gender of the 
subject, time and frequency of administration, drug combination(s), reaction sensitivities, and response 
to therapy. Long-acting compositions may be administered every 3 to 4 days, every week, or 
biweekly depending on the half-life and clearance rate of the particular formulation. 

25 Normal dosage amounts may vary from about 0.1 yug to 100,000 //g, up to a total dose of 

about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 

30 conditions, locations, etc. 
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In another embodiment, antibodies which specifically bind TRICH may be used for the 
diagnosis of disorders characterized by expression of TRICH, or in assays to monitor patients being 
treated with TRICH or agonists, antagonists, or inhibitors of TRICH. Antibodies useful for diagnostic 
purposes may be prepared in the same manner as described above for therapeutics. Diagnostic 
5 assays for TRICH include methods which utilize the antibody and a label to detect TRICH in human 
body fluids or in extracts of cells or tissues. The antibodies may be used with or without modification, 
and may be labeled by covalent or non-covalent attachment of a reporter molecule. A wide variety of 
reporter molecules, several of which are described above, are known in the art and may be used. 

A variety of protocols for measuring TRICH, including ELJSAs, RIAs, and FACS, are known 

10 in the art and provide a basis for diagnosing altered or abnormal levels of TRICH expression. Normal 
or standard values for TRICH expression are established by combining body fluids or cell extracts 
taken from normal mammalian subjects, for example, human subjects, with antibodies to TRICH under 
conditions suitable for complex formation. The amount of standard complex formation may be 
quantitated by various methods, such as photometric means. Quantities of TRICH expressed in 

15 subject, control, and disease samples from biopsied tissues are compared with the standard values. 
Deviation between standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, polynucleotides encoding TRICH may be used for 
diagnostic purposes. The polynucleotides which may be used include oligonucleotides, complementary 
RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and quantify gene 

20 expression in biopsied tissues in which expression of TRICH may be correlated with disease. The 
diagnostic assay may be used to determine absence, presence, and excess expression of TRICH, and 
to monitor regulation of TRICH levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotides, 
including genomic sequences, encoding TRICH or closely related molecules may be used to identify 

25 nucleic acid sequences which encode TRICH. The specificity of the probe, whether it is made from a 
highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g., a conserved 
motif, and the stringency of the hybridization or amplification will determine whether the probe 
identifies only naturally occurring sequences encoding TRICH, allelic variants, or related sequences. 
Probes may also be used for the detection of related sequences, and may have at least 50% 

30 sequence identity to any of the TRICH encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:60-1 18 or 
from genomic sequences including promoters, enhancers, and introns of the TRICH gene. 
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Means for producing specific hybridization probes for polynucleotides encoding TRICH 
include the cloning of polynucleotides encoding TRICH or TRICH derivatives into vectors for the 
production of mRNA probes. Such vectors are known in the art, are commercially available, and may 
be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
5 polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a 
variety of reporter groups, for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, 
such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotides encoding TRICH may be used for the diagnosis of disorders associated with 
expression of TRICH. Examples of such disorders include, but are not limited to, a transport disorder 

10 such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, Becker's muscular 
dystrophy, Bell's palsy, Charcot-Marie Tooth disease, diabetes mellitus, diabetes insipidus, diabetic 
neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic paralysis, normokalemic periodic 
paralysis, Parkinson's disease, malignant hyperthermia, multidrug resistance, myasthenia gravis, 
myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral neuropathy, cerebral 

15 neoplasms, prostate cancer, cardiac disorders associated with transport, e.g., angina, bradyarrythmia, 
tachyarrythmia, hypertension, Long QT syndrome, myocarditis, cardiomyopathy, nemaline myopathy, 
centronuclear myopathy, lipid myopathy, mitochondrial myopathy, thyrotoxic myopathy, ethanol 
myopathy, dermatomyositis, inclusion body myositis, infectious myositis, polymyositis, neurological 
disorders associated with transport, e.g., Alzheimer's disease, amnesia, bipolar disorder, dementia, 

20 depression, epilepsy, Tourette's disorder, paranoid psychoses, and schizophrenia, and other disorders 
associated with transport, e.g., neurofibromatosis, postherpetic neuralgia, trigeminal neuropathy, 
sarcoidosis, sickle cell anemia, Wilson's disease, cataracts, infertility, pulmonary artery stenosis, 
sensorineural autosomal deafness, hyperglycemia, hypoglycemia, Grave's disease, goiter, Cushing's 
disease, Addison's disease, glucose-galactose malabsorption syndrome, glycogen storage disease, 

25 hypercholesterolemia, adrenoleukodystrophy, Zellweger syndrome, Menkes disease, occipital horn 
syndrome, von Gierke disease, pseudohypoaldosteronism type 1, Liddle's syndrome, cystinuria, 
iminoglycinuria, Hartup disease, Fanconi disease, and Bartter syndrome; a neurological disorder such 
as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's 
disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, 

30 amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular atrophy, 
retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating diseases, bacterial 
and viral meningitis, brain abscess, subdural empyema, epidural abscess, suppurative intracranial 
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thrombophlebitis, myelitis and radiculitis, viral central nervous system disease, prion diseases including 
kuru, Creutzfeldt- Jakob disease, and Gerstmann-Straussler-Scheinker syndrome, fatal familial 
insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, tuberous 
sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal syndrome, mental retardation 

5 and other developmental disorders of the central nervous system including Down syndrome, cerebral 
palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial nerve disorders, spinal cord 
diseases, muscular dystrophy and other neuromuscular disorders, peripheral nervous system disorders, 
dermatomyositis and polymyositis, inherited, metabolic, endocrine, and toxic myopathies, myasthenia 
gravis, periodic paralysis, mental disorders including mood, anxiety, and schizophrenic disorders, 

10 seasonal affective disorder (SAD), akathesia, amnesia, catatonia, diabetic neuropathy, hemiplegic 

migraine, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, Tourette's disorder, 
progressive supranuclear palsy, corticobasal degeneration, and familial frontotemporal dementia; a 
muscle disorder such as cardiomyopathy, myocarditis, Duchenne's muscular dystrophy, Becker's 
muscular dystrophy, myotonic dystrophy, central core disease, nemaline myopathy, centronuclear 

15 myopathy, lipid myopathy, mitochondrial myopathy, infectious myositis, polymyositis, dermatomyositis, 
inclusion body myositis, thyrotoxic myopathy, ethanol myopathy, angina, anaphylactic shock, 
arrhythmias, asthma, cardiovascular shock, Cushing's syndrome, hypertension, hypoglycemia, 
myocardial infarction, migraine, pheochromocytoma, and myopathies including encephalopathy, 
epilepsy, Kearns-Sayre syndrome, lactic acidosis, myoclonic disorder, ophthalmoplegia, acid maltase 

20 deficiency (AMD, also known as Pompe's disease), generalized myotonia, and myotonia congenita; an 
immunological disorder such as acquired immunodeficiency syndrome (AIDS), Addison's disease, 
adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, 
atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 
polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 

25 dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, episodic 
lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, 
glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's thyroiditis, 
hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, myocardial or 
pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's 

30 syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic 
lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner 
syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, 
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fungal, parasitic, protozoal, and helminthic infections, and trauma; and a cell proliferative disorder such 
as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective 
tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, 
psoriasis, primary thrombocythemia, and cancers including adenocarcinoma, leukemia, lymphoma, 
5 melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, 
bladder, bone, bone marrow, brain, breast, cervix, colon, gall bladder, ganglia, gastrointestinal tract, 
heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, 
spleen, testis, thymus, thyroid, and uterus. Polynucleotides encoding TRICH may be used in Southern 
or northern analysis, dot blot, or other membrane-based technologies; in PCR technologies; in dipstick, 

10 pin, and multiformat ELISA-like assays; and in microarrays utilizing fluids or tissues from patients to 
detect altered TRICH expression. Such qualitative or quantitative methods are well known in the art. 

In a particular embodiment, polynucleotides encoding TRICH may be used in assays that 
detect the presence of associated disorders, particularly those mentioned above. Polynucleotides 
complementary to sequences encoding TRICH may be labeled by standard methods and added to a 

15 fluid or tissue sample from a patient under conditions suitable for the formation of hybridization 

complexes. After a suitable incubation period, the sample is washed and the signal is quantified and ■ 
compared with a standard value. If the amount of signal in the patient sample is significantly altered in 
comparison to a control sample then the presence of altered levels of polynucleotides encoding 
TRICH in the sample indicates the presence of the associated disorder. Such assays may also be 

20 used to evaluate the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical 
trials, or to monitor the treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with expression of 
TRICH, a normal or standard profile for expression is established. This may be accomplished by 
combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 

25 sequence, or a fragment thereof, encoding TRICH, under conditions suitable for hybridization or 

amplification. Standard hybridization may be quantified by comparing the values obtained from normal 
subjects with values from an experiment in which a known amount of a substantially purified 
polynucleotide is used. Standard values obtained in this manner may be compared with values 
obtained from samples from patients who are symptomatic for a disorder. Deviation from standard 

30 values is used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
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patient begins to approximate that which is observed in the normal subject. The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
5 overexpressed) in biopsied tissue from an individual may indicate a predisposition for the development 
of the disease, or may provide a means for detecting the disease prior to the appearance of actual 
clinical symptoms. A more definitive diagnosis of this type may allow health professionals to employ 
preventative measures or aggressive treatment earlier, thereby preventing the development or further 
progression of the cancer. 

10 Additional diagnostic uses for oligonucleotides designed from the sequences encoding TRICH 

may involve the use of PCR. These oligomers may be chemically synthesized, generated 
enzymatically, or produced in vitro. Oligomers will preferably contain a fragment of a polynucleotide 
encoding TRICH, or a fragment of a polynucleotide complementary to the polynucleotide encoding 
TRICH, and will be employed under optimized conditions for identification of a specific gene or 

15 condition. Oligomers may also be employed under less stringent conditions for detection or 

quantification of closely related DNA or RNA sequences. ■ ..; 

In a particular aspect, oligonucleotide primers derived from polynucleotides encoding TRICH 
may be used to detect single nucleotide polymorphisms (SNPs). SNPs are substitutions, insertions and 
deletions that are a frequent cause of inherited or acquired genetic disease in humans. Methods of 

20 SNP detection include, but are not limited to, single-stranded conformation polymorphism (SSCP) and 
fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers derived from polynucleotides 
encoding TRICH are used to amplify DNA using the polymerase chain reaction (PCR). The DNA 
may be derived, for example, from diseased or normal tissue, biopsy samples, bodily fluids, and the 
like. SNPs in the DNA cause differences in the secondary and tertiary structures of PCR products in 

25 single-stranded form, and these differences are detectable using gel electrophoresis in non-denaturing 
gels. In fSCCP, the oligonucleotide primers are fluorescently labeled, which allows detection of the 
amplimers in high-throughput equipment such as DNA sequencing machines. Additionally, sequence 
database analysis methods, termed in silico SNP (isSNP), are capable of identifying polymorphisms by 
comparing the sequence of individual overlapping DNA fragments which assemble into a common 

30 consensus sequence. These computer-based methods filter out sequence variations due to laboratory 
preparation of DNA and sequencing errors using statistical models and automated analyses of DNA 
sequence chromatograms. In the alternative, SNPs may be detected and characterized by mass 
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spectrometry using, for example, the high throughput MASSARRAY system (Sequenom, Inc., San 
Diego CA). 

SNPs may be used to study the genetic basis of human disease. For example, at least 16 
common SNPs have been associated with non-insulin-dependent diabetes mellitus. SNPs are also 
5 useful for examining differences in disease outcomes in monogenic disorders, such as cystic fibrosis, 
sickle cell anemia, or chronic granulomatous disease. For example, variants in the mannose-binding 
lectin, MBL2, have been shown to be correlated with deleterious pulmonary outcomes in cystic 
fibrosis. SNPs also have utility in pharmacogenomics, the identification of genetic variants that 
influence a patient's response to a drug, such as life-threatening toxicity. For example, a variation in 

10 N-acetyl transferase is associated with a high incidence of peripheral neuropathy in response to the 
anti-tuberculosis drug isoniazid, while a variation in the core promoter of the ALOX5 gene results in 
diminished clinical response to treatment with an anti-asthma drug that targets the 5-lipoxygenase 
pathway. Analysis of the distribution of SNPs in different populations is useful for investigating 
genetic drift, mutation, recombination, and selection, as well as for tracing the origins of populations 

15 and their migrations (Taylor, J.G. et al. (2001) Trends Mol. Med. 7:507-512; Kwok, P.-Y. and Z. Gu ■ 
(1999) Mol. Med. Today 5:538-543; Nowotny, P. et al. (2001) Curr. Opin. Neurobiol. 11:637-641). 

Methods which may also be used to quantify the expression of TRICH include radiolabeling or 
biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 
standard curves (Melby, P.C et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. et al. (1993) 

20 Anal. Biochem. 212:229-236). The speed of quantitation of multiple samples may be accelerated by 
running the assay in a high-throughput format where the oligomer or polynucleotide of interest is 
presented in various dilutions and a spectrophotometric or colorimetric response gives rapid 
quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
25 polynucleotides described herein may be used as elements on a microarray. The microarray can be 
used in transcript imaging techniques which monitor the relative expression levels of large numbers of 
genes simultaneously as described below. The microarray may also be used to identify genetic 
variants, mutations, and polymorphisms. This information may be used to determine gene function, to 
understand the genetic basis of a disorder, to diagnose a disorder, to monitor progression/regression of 
30 disease as a function of gene expression, and to develop and monitor the activities of therapeutic 
agents in the treatment of disease. In particular, this information may be used to develop a 
phannacogenomic profile of a patient in order to select the most appropriate and effective treatment 
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regimen for that patient. For example, therapeutic agents which are highly effective and display the 
fewest side effects may be selected for a patient based on his/her pharmacogenomic profile. 

In another embodiment, TRICH, fragments of TRICH, or antibodies specific for TRICH may 
be used as elements on a microarray. The microarray may be used to monitor or measure protein- 

5 protein interactions, drug-target interactions, and gene expression profiles, as described above. 

A particular embodiment relates to the use of the polynucleotides of the present invention to 
generate a transcript image of a tissue or cell type. A transcript image represents the global pattern of 
gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed by 
quantifying the number of expressed genes and their relative abundance under given conditions and at 

10 a given time (Seilhamer et al., "Comparative Gene Transcript Analysis," U.S. Patent No. 5,840,484; 
hereby expressly incorporated by reference herein). Thus a transcript image may be generated by 
hybridizing the polynucleotides of the present invention or their complements to the totality of 
transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, the 
hybridization takes place in high-throughput format, wherein the polynucleotides of the present 

15 invention or their complements comprise a subset of a plurality of elements on a microarray. The 
resultant transcript image would provide a profile of gene activity. 

Transcript images may be generated using transcripts isolated from tissues, cell lines, biopsies,, 
or other biological samples. The transcript image may thus reflect gene expression in vivo, as in the . 
case of a tissue or biopsy sample, or in vitro, as in the case of a cell line. 

20 Transcript images which profile the expression of the polynucleotides of the present invention 

may also be used in conjunction with in vitro model systems and preclinical evaluation of 
pharmaceuticals, as well as toxicological testing of industrial and natttrally-occurring environmental 
compounds. All compounds induce characteristic gene expression patterns, frequentiy termed 
molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and toxicity 

25 (Nuwaysir, E.F. et al. (1999) Mol. Carcinog. 24:153-159; Steiner, S. and N.L. Anderson (2000) 
Toxicol. Lett 1 12-1 13:467-471). If a test compound has a signature similar to that of a compound 
with known toxicity, it is likely to share those toxic properties. These fingerprints or signatures are 
most useful and refined when they contain expression information from a large number of genes and 
gene families. Ideally, a genome-wide measurement of expression provides the highest quality 

30 signature. Even genes whose expression is not altered by any tested compounds are important as 
well, as the levels of expression of these genes are used to normalize the rest of the expression data. 
The normalization procedure is useful for comparison of expression data after treatment with different 
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compounds. While the assignment of gene function to elements of a toxicant signature aids in 
interpretation of toxicity mechanisms, knowledge of gene function is not necessary for the statistical 
matching of signatures which leads to prediction of toxicity (see, for example, Press Release 00-02 
from the National Institute of Environmental Health Sciences, released February 29, 2000, available at 
5 http://www.niehs.nih.gov/oc/news/toxchip.htm). Therefore, it is important and desirable in 
toxicological screening using toxicant signatures to include all expressed gene sequences. 

In an embodiment, the toxicity of a test compound can be assessed by treating a biological 
sample containing nucleic acids with the test compound. Nucleic acids that are expressed in the 
treated biological sample are hybridized with one or more probes specific to the polynucleotides of the 

10 present invention, so that transcript levels corresponding to the polynucleotides of the present invention 
may be quantified. The transcript levels in the treated biological sample are compared with levels in 
an untreated biological sample. Differences in the transcript levels between the two samples are 
indicative of a toxic response caused by the test compound in the treated sample. 

Another embodiment relates to the use of the polypeptides disclosed herein to analyze the 

15 proteome of a tissue or cell type. The term proteome refers to the global pattern of protein expression 
in a particular tissue or cell type. Each protein component of a proteome can be subjected individually 
to further analysis. Proteome expression patterns, or profiles, are analyzed by quantifying the number 
of expressed proteins and their relative abundance under given conditions and at a given time. A 
profile of a cell's proteome may thus be generated by separating and analyzing the polypeptides of a 

20 particular tissue or cell type. In one embodiment, the separation is achieved using two-dimensional gel 
electrophoresis, in which proteins from a sample are separated by isoelectric focusing in the first 
dimension, and then according to molecular weight by sodium dodecyl sulfate slab gel electrophoresis 
in the second dimension (Steiner and Anderson, supra). The proteins are visualized in the gel as 
discrete and uniquely positioned spots, typically by staining the gel with an agent such as Coomassie 

25 Blue or silver or fluorescent stains. The optical density of each protein spot is generally proportional to 
the level of the protein in the sample. The optical densities of equivalently positioned protein spots 
from different samples, for example, from biological samples either treated or untreated with a test 
compound or therapeutic agent, are compared to identify any changes in protein spot density related to 
the treatment. The proteins in the spots are partially sequenced using, for example, standard methods 

30 employing chemical or enzymatic cleavage followed by mass spectrometry. The identity of the protein 
in a spot may be determined by comparing its partial sequence, preferably of at least 5 contiguous 
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amino acid residues, to the polypeptide sequences of interest. In some cases, further sequence data 

may be obtained for definitive protein identification. 

A proteomic profile may also be generated using antibodies specific for TRICH to quantify 

the levels of TRICH expression. In one embodiment, the antibodies are used as elements on a 
5 microarray, and protein expression levels are quantified by exposing the microarray to the sample and 

detecting the levels of protein bound to each array element (Lueking, A. et al. (1999) Anal. Biochem. 

270:103-111; Mendoze, L.G. et al. (1999) Biotechniques 27:778-788). Detection may be performed by 

a variety of methods known in the art, for example, by reacting the proteins in the sample with a thiol- 

or amino-reactive fluorescent compound and detecting the amount of fluorescence bound at each 
10 array element. 

Toxicant signatures at the proteome level are also useful for toxicological screening, and 
should be analyzed in parallel with toxicant signatures at the transcript level. There is a poor 
correlation between transcript and protein abundances for some proteins in some tissues (Anderson, 
N.L. and J. Seilhamer (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be 

15 useful in the analysis of compounds which do not significandy affect the transcript image, but which 
alter the proteomic profile. Li addition, the analysis of transcripts in body fluids is difficult, due to rapid 
degradation of mRNA, so proteomic profiling may be more reliable and informative in such cases. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins that are expressed in the treated 

20 biological sample are separated so that the amount of each protein can be quantified. The amount of 
each protein is compared to the amount of the corresponding protein in an untreated biological sample. 
A difference in the amount of protein between the two samples is indicative of a toxic response to the 
test compound in the treated sample. Individual proteins are identified by sequencing the amino acid 
residues of the individual proteins and comparing these partial sequences to the polypeptides of the 

25 present invention. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins from the biological sample are incubated 
with antibodies specific to the polypeptides of the present invention. The amount of protein recognized 
by the antibodies is quantified. The amount of protein in the treated biological sample is compared 
30 with the amount in an untreated biological sample. A difference in the amount of protein between the 
two samples is indicative of a toxic response to the test compound in the treated sample. 
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Microarrays may be prepared, used, and analyzed using methods known in the art (Brennan, 
T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. USA ' 
93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251 16; Shalon, D. et al. (1995) 
PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150-2155; 
5 Heller, M.J. et al. (1997) U.S. PatentNo. 5,605,662). Various types of microarrays are well known' 
and thoroughly described in Schena, M., ed. (1999; DNA Micrnarravg- a An r ^ h Oxford 

University Press, London). 

In another embodiment of the invention, nucleic acid sequences encoding TRICH may be 
used to generate hybridization probes useful in mapping the naturally occurring genomic sequence. 
10 Either coding or noncoding sequences may be used, and in some instances, noncoding sequences may 
be preferable over coding sequences. For example, conservation of a coding sequence among 
members of a multi-gene family may potentially cause undesired cross hybridization during 
chromosomal mapping. The sequences may be mapped to a particular chromosome, to a specific 
region of a chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes 
15 (HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 
constructions, or single chromosome cDNA libraries (Harrington, J.J. et al. (1997) Nat. Genet. 15:345- 
355; Price, CM. (1993) Blood Rev. 7:127-134; Trask, BJ. (1991) Trends Genet 7:149-154). Once 
mapped, die nucleic acid sequences may be used to develop genetic linkage maps, for example, which 
correlate the inheritance of a disease state with the inheritance of a particular chromosome region or 
20 restriction fragment length polymorphism (RFLP) (Lander, E.S. and D. Botstein (1986) Proc. Nad. 
Acad. Sci. USA 83:7353-7357). 

Fluorescent in situ hybridization (FISH) may be correlated with omer physical and genetic 
map data (Heinz-Ulrich, et al. (1995) in Meyers, supra, pp. 965-968). Examples of genetic map data 
can be found in various scientific journals or at the Online Mendelian Inheritance in Man (OMIM) 
25 World Wide Web site. Correlation between the location of the gene encoding TRICH on a physical 
map and a specific disorder, or a predisposition to a specific disorder, may help define the region of 
DNA associated with that disorder and thus may further positional cloning efforts. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
30 Often the placement of a gene on the chromosome of another mammalian species, such as mouse, 
may reveal associated markers even if the exact chromosomal locus is not known. This information is 
valuable to investigators searching for disease genes using positional cloning or other gene discovery 
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techniques. Once the gene or genes responsible for a disease or syndrome have been crudely 
localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, any 
sequences mapping to that area may represent associated or regulatory genes for further investigation 
(Gatti, R.A. et al. (1988) Nature 336:577-580). The nucleotide sequence of the instant invention may 
5 also be used to detect differences in the chromosomal location due to translocation, inversion, etc., 
among normal, carrier, or affected individuals. 

In another embodiment of the invention, TRICH, its catalytic or immunogenic fragments, or 
oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be free in solution, affixed to a 

10 solid support, borne on a cell surface, or located intracellularly. The formation of binding complexes 
between TRICH and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 
having suitable binding affinity to the protein of interest (Geysen, et al. (1984) PCT application 
WO84/03564). In this method, large numbers of different small test compounds are synthesized on a 

15 solid substrate. The test compounds are reacted with TRICH, or fragments thereof, and washed. 
Bound TRICH is then detected by methods well known in the art. Purified TRICH can also be 
coated direcdy onto plates for use in the aforementioned drug screening techniques. Alternatively, 
non-neutralizing antibodies can be used to capture the peptide and immobilize it on a solid support. 

In another embodiment, one may use competitive drug screening assays in which neutralizing 

20 antibodies capable of binding TRICH specifically compete with a test compound for binding TRICH. 
In this manner, antibodies can be used to detect the presence of any peptide which shares one or more 
antigenic determinants with TRICH. 

In additional embodiments, the nucleotide sequences which encode TRICH may be used in 
any molecular biology techniques that have yet to be developed, provided the new techniques rely on 

25 properties of nucleotide sequences that are currently known, including, but not limited to, such 
properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the 
preceding description, utilize the present invention to its fullest extent. The following preferred specific 
embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder 

30 of the disclosure in any way whatsoever. 
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The disclosures of aU patents, applications, and publications mentioned above and below 
includingU.S. Ser.No. 60/368,840, andU.S. Ser. No. 60/375,637, are hereby expressly incorporated 



by reference. 



5 EXAMPLES 
I. Construction of cDNA Libraries 

Incyte cDNAs were derived from cDNA libraries described in the LIFESEQ GOLD 
database (Incyte Genomics, Palo Alto CA). Some tissues were homogenized and lysed in guanidinium 
isotmocyanate, while others were homogenized and lysed in phenol or in a suitable mixture of 
10 denaturants, such as TRIZOL (Invitrogen), a monophasic solution of phenol and guanidine 
isomiocyanate. The resulting lysates were centrifuged over CsCl cushions or extracted with 
chloroform. RNA was precipitated from the lysates with either isopropanol or sodium acetate and 
ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
15 purity. In some cases, RNA was treated with DNase. For most libraries, poly(A) + RNA was 
isolated using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles 
(QIAGEN, Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively . 
RNA was isolated directly from tissue lysates using other RNA isolation kits, e.g., the ' . 

POLY(A)PURE mRNA purification kit (Ambion, Austin TX). 
20 In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 

libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the 
UNEAP vector system (Stratagene) or SUPERSCRIPT plasmid system (Invitrogen), using the 
recommended procedures or similar methods known in the art (Ausubel et al., supra, ch. 5). Reverse 
hanscnption was initiated using oligo d(T) or random primers. Synthetic oligonucleotide adapters were 
25 hgated to double stranded cDNA, and the cDNA was digested with the appropriate restriction enzyme 
or enzymes. For most libraries, the cDNA was size-selected (300-1000 bp) using SEPHACRYL 
S1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column chromatography (Amersham 
Broscaences) or preparative agarose gel electrophoresis. cDNAs were ligated into compatible 
restriction enzyme sites of the polylinker of a suitable plasmid, e.g., PBLUESCRIPT plasmid 
30 (Stratagene), PSPORT1 plasmid (Invitrogen, Carlsbad CA), PCDNA2.1 plasmid (Invitrogen) PBK- 
CMV plasmid (Stratagene), PCR2-TOPOTA plasmid (Invitrogen), PCMV-ICIS plasmid (Stratagene) 
pIGEN (Incyte Genomics, Palo Alto CA), pRARE (Incyte Genomics), or pINCY (Incyte Genomics) ' 
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or derivatives thereof. Recombinant plasmids were transformed into competent £ coli cells including 
XLl-Blue, XLl-BlueMRF, or SOLR from Stratagene or DH5a, DH10B, or ElectroMAX DH10B 
from Invitrogen. 

IL Isolation of cDNA Clones 

5 Plasmids obtained as described in Example I were recovered from host cells by in vivo 

excision using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified using 
at least one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an 
AGTC Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, 
QIAWELL 8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R.E.A.L. PREP 

10 96 plasmid purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0. 1 
ml of distilled water and stored, with or without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 

15 384-well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically using 
PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence scanner . 
(Labsystems Oy, Helsinki, Finland). 
HI. Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example II were sequenced as follows. 

20 Sequencing reactions were processed using standard methods or high-throughput instrumentation such 
as the ABI CATALYST 800 (Applied Biosystems) thermal cycler or the PTC-200 thermal cycler 
(MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the 
MICROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared 
using reagents provided by Amersham Biosciences or supplied in ABI sequencing kits such as the 

25 ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 

Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides 
were carried out using the MEGABACE 1000 DNA sequencing system (Amersham Biosciences); 
the ABI PRISM 373 or 377 sequencing system (Applied Biosystems) in conjunction with standard 
ABI protocols and base calling software; or other sequence analysis systems known in the art. 

30 Reading frames within the cDNA sequences were identified using standard methods (Ausubel et al., 
supra, ch. 7). Some of the cDNA sequences were selected for extension using the techniques 
disclosed in Example VUL 
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Hie polynucleotide sequences derived from Incyte cDNAs were validated by removing 
vector, linker, and poly(A) sequences and by masking ambiguous bases, using algorithms and 
programs based on BLAST, dynamic programming, and dinucleotide nearest neighbor analysis The 
Mcy te cDNA sequences or translations thereof were then queried against a selection of public 
5 databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases and 
BLOCKS, PRINTS, DOMO, PRODOM; PROTEOME databases with sequences from Homo 
sapiens, Rams norvegicus, Mus musculus, Caenorhabditis elegans, Saccharomyces cerevisiae, 
Schizosaccharomycespombe, and Candida albicans (Incyte Genomics, Palo Alto CA); hidden 
Markov model (HMM)-based protein family databases such as PFAM, INCY, and TIGRFAM (Haft 
10 D.H. et al. (2001) Nucleic Acids Res. 29:41-43); and HMM-based protein domain databases such as' 
SMART (Schultz, J. et al. (1998) Proc. Natl. Acad, Sci. USA 95:5857-5864; Letunic, I. et al (2002) 
Nucleic Acids Res. 30:242-244). (HMM is a probabilistic approach which analyzes consensus 
primary structures of gene families; see, for example, Eddy, S.R. (1996) Curr. Opin. Struct Biol. 
6:361-365.) The queries were performed using programs based on BLAST, FASTA, BLIMPS, and 
15 HMMER. The Incyte cDNA sequences were assembled to produce full length polynucleotide ' 
■ sequences. Alternatively, GenBank cDNAs, GenBank ESTs, stitched sequences, stretched 
sequences, or Genscan-predicted coding sequences (see Examples IV and V) were used to extend 
Incyte cDNA assemblages to full length. Assembly was performed using programs based on Phred 
Phrap, and Consed, and cDNA assemblages were screened for open reading frames using programs 
20 ^onGeneMa.k.BLAST.andFASTA.The™ 

to denve the corresponding full length polypeptide sequences. Alternatively, a polypeptide may begin 
atanyofmemetmonmeresidue^ofmefuUlengmtranslate^ Full length polypeptide 

sequences were subsequently analyzed by querying against databases such as the GenBank protein 
databases (genpept), SwissProt, the PROTEOME databases, BLOCKS, PRINTS, DOMO, 
25 PRODOM, Prosite, hidden Markoy model (HMM)-based protein family databases such as PFAM, 
INCY, and TIGRFAM; and HMM-based protein domain databases such as SMART. Full length ' 
polynucleotide sequences are also analyzed using MACDNASIS PRO software (MiraiBio Alameda 
CA) and LASERGENE software (DNASTAR). Polynucleotide and polypeptide sequence alignments 
are generated using default parameters specified by the CLUSTAL algorithm as incorporated into the 
30 MEGALIGN multisequence alignment program (DNASTAR), which also calculates the percent 
identity between aligned sequences. 



89 



WO 03/083085 



PCT/US03/09797 



Table 7 summarizes the tools, programs, and algorithms used for the analysis and assembly of 
Incyte cDNA and full length sequences and provides applicable descriptions, references, and threshold 
parameters. The first column of Table 7 shows the tools, programs, and algorithms used, the second 
column provides brief descriptions thereof, the third column presents appropriate references, all of 

5 which are incorporated by reference herein in their entirety, and the fourth column presents, where 
applicable, the scores, probability values, and other parameters used to evaluate the strength of a 
match between two sequences (the higher the score or the lower the probability value, the greater the 
identity between two sequences). 

The programs described above for the assembly and analysis of full length polynucleotide and 

10 polypeptide sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
NO:60-1 18. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
amplification technologies are described in Table 4, column 2. 
IV. Identification and Editing of Coding Sequences from Genomic DNA 

Putative transporters and ion channels were initially identified by running the Genscan gene 

15 identification program against public genomic sequence databases (e.g., gbpri and gbhtg). Genscan is 
a general-purpose gene identification program which analyzes genomic DNA sequences from a 
variety of organisms (Burge, C. and S. Karlin (1997) J. Mol. Biol. 268:78-94; Burge, C. and S. Karlin 
(1998) Curr. Opin. Struct Biol. 8:346-354). The program concatenates predicted exons to form an 
assembled cDNA sequence extending from a methionine to a stop codon. The output of Genscan is a 

20 FASTA database of polynucleotide and polypeptide sequences. The maximum range of sequence for 
Genscan to analyze at once was set to 30 kb. To determine which of these Genscan predicted cDNA 
sequences encode transporters and ion channels, the encoded polypeptides were analyzed by querying 
against PFAM models for transporters and ion channels. Potential transporters and ion channels were 
also identified by homology to Incyte cDNA sequences that had been annotated as transporters and 

25 ion channels. These selected Genscan-predicted sequences were then compared by BLAST analysis 
to the genpept and gbpri public databases. Where necessary, the Genscan-predicted sequences were 
then edited by comparison to the top BLAST hit from genpept to correct errors in the sequence 
predicted by Genscan, such as extra or omitted exons. BLAST analysis was also used to find any 
Incyte cDNA or public cDNA coverage of the Genscan-predicted sequences, thus providing evidence 

30 for transcription. When Incyte cDNA coverage was available, this information was used to correct or 
confirm the Genscan predicted sequence. Full length polynucleotide sequences were obtained by 
assembling Genscan-predicted coding sequences with Incyte cDNA sequences and/or public cDNA 
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sequences using the assembly process described in Example m. Alternatively, full length 
polynucleotide sequences were derived entirely from edited or unedited Genscan-predicted coding 
sequences. 

V. Assembly of Genomic Sequence Data with cDNA Sequence Data 
5 "Stitched" Seq iiwirAc 

Partial cDNA sequences were extended with exons predicted by the Genscan gene 
identification program described in Example IV. Partial cDNAs assembled as described in Example 
HI were mapped to genomic DNA and parsed into clusters containing related cDNAs and Genscan 
exon predictions from one or more genomic sequences. Each cluster was analyzed using an algorithm 
10 based on graph theory and dynamic progxairmung to integrate cDNA and genomic information, 
generating possible splice variants that were subsequently confirmed, edited, or extended to create a 
full length sequence. Sequence intervals in which the entire length of the interval was present on 
more than one sequence in the cluster were identified, and intervals thus identified were considered to 
be equivalent by transitivity. For example, if an interval was present on a cDNA and two genomic , 
15 sequences, then all three intervals were considered to be equivalent. This process allows unrelated 
but consecutive genomic sequences to be brought together, bridged by cDNA sequence. Intervals 
thus identified were then "stitched" together by the stitching algorithm in the order that they appear - 
along their parent sequences to generate the longest possible sequence, as well as sequence variants. 
Linkages between intervals which proceed along one type of parent sequence (cDNA to cDNA or 
20 genomic sequence to genomic sequence) were given preference over linkages which change parent 
type (cDNA to genomic sequence). The resultant stitched sequences were translated and compared 
by BLAST analysis to the genpept and gbpri public databases. Incorrect exons predicted by Genscan 
were corrected by comparison to the top BLAST hit from genpept. Sequences were further extended 
with additional cDNA sequences, or by inspection of genomic DNA, when necessary. 
25 "Stretched" Seq uences 

Partial DNA sequences were extended to full length with an algorithm based on BLAST 
analysis. First, partial cDNAs assembled as described in Example D3 were queried against public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases 
using the BLAST program. The nearest GenBank protein homolog was then compared by BLAST 
30 analysis to either Incyte cDNA sequences or GenScan exon predicted sequences described in 
Example IV. A chimeric protein was generated by using the resultant high-scoring segment pairs 
(HSPs) to map the translated sequences onto the GenBank protein homolog. Insertions or deletions 
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may occur in the chimeric protein with respect to the original GenBank protein homolog. The 
GenBank protein homolog, the chimeric protein, or both were used as probes to search for homologous 
genomic sequences from the public human genome databases. Partial DNA sequences were 
therefore "stretched" or extended by the addition of homologous genomic sequences. The resultant 
5 stretched sequences were examined to determine whether it contained a complete gene. 

VI. Chromosomal Mapping of TRICH Encoding Polynucleotides 

The sequences which were used to assemble SEQ ID NO:60-118 were compared with 
sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith-Waterman algorithm. Sequences from these databases that matched 

10 SEQ ID NO:60-1 1 8 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 7). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
Genome Research (WIGR), and Genethon were used to determine if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 

15 of all sequences of that cluster, including its particular SEQ ID NO:, to that map location. 

Map locations are represented by ranges, or intervals, of human chromosomes. The map 
position of an interval, in centiMorgans, is measured relative to the terminus of the chromosome's p- 
ann. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies between 
chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in 

20 humans, although this can vary widely due to hot and cold spots of recombination.) The cM distances 
are based on genetic markers mapped by Genethon which provide boundaries for radiation hybrid 
markers whose sequences were included in each of the clusters. Human genome maps and other 
resources available to the public, such as the NCBI "GeneMap'99" World Wide Web site 
(http://www.ncbi.nlm.nih.gov/genemap/), can be employed to determine if previously identified disease 

25 genes map within or in proximity to the intervals indicated above. 

VII. Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 
gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs 
from a particular cell type or tissue have been bound (Sambrook and Russell, supra, ch. 7; Ausubel et 
30 al., supra, ch. 4). 

Analogous computer techniques applying BLAST were used to search for identical or related 
molecules in databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is much 
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basis of the search is the product score, which is defined as: 

BLAST Score s Percent Trfcnrify 



5 x minimum {lenglh(Seq. 1), Iength(Seq. 2)} 
le^of.hesequenceraa.e.. T*^^^^^^ 
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expression of cDNA encoding TRICH. cDNA sequences and cDNA library/tissue information are 
found in the LEFESEQ GOLD database (Incyte Genomics, Palo Alto CA). 
Vffl. Extension of TRICH Encoding Polynucleotides 

Full length polynucleotides are produced by extension of an appropriate fragment of the full 

5 length molecule using oligonucleotide primers designed from this fragment. One primer was 
synthesized to initiate 5' extension of the known fragment, and the other primer was synthesized to 
initiate 3' extension of the known fragment. The initial primers were designed using OIIGO 4.06 
software (National Biosciences), or another appropriate program, to be about 22 to 30 nucleotides in 
length, to have a GC content of about 50% or more, and to anneal to the target sequence at 

10 temperatures of about 68 °C to about 72°C. Any stretch of nucleotides which would result in hairpin 
structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 
extension was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PCR using methods well known in the art. PCR 

15 was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2 *, (NH 4 ) 2 S0 4 , 
and 2-mercaptoethanol, Taq DNA polymerase (Amersham Biosciences), ELONGASE enzyme 
(Invitrogen), and Pfu DNA polymerase (Stratagene), with the following parameters for primer pair 
PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68°C, 2 

20 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair 17 and SK+ were as follows: Step 1: 94 °C, 3 min; Step 2: 
94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68 °C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: 68°C, 5 min; Step 7: storage at 4°C. 

The concentration of DNA in each well was determined by dispensing 100 fil PICOGREEN 

25 quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 /xl of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, 
Acton MA), allowing the DNA to bind to the reagent The plate was scanned in a Fluoroskan II 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
concentration of DNA. A 5 {A to 10 fA aliquot of the reaction mixture was analyzed by 

30 electrophoresis on a 1 % agarose gel to determine which reactions were successful in extending the 
sequence. 
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The extended nucleotides were desalted and concentrated, transferred to 384-well plates 
digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI) and 
sonicated or sheared prior to negation into pUC 18 vector (Amersham Biosciences). For shotgun 
sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) agarose gels, 
5 fragments were excised, and agar digested with Agar ACE (Promega). Extended clones were 
rehgated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 
Biosciences), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site overhangs and 
transfectedintocompetentS. co« cells. Transformed cells were selected on antibiotic-coutaming 
media, and individual colonies were picked and cultured overnight at 37'C in 384-well plates in LB/2x 
10 carb liquid media. 

The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase 
(Amersham Biosciences) and Pfu DNA polymerase (Stratagene) with the following parameters- Step 
1: 94»C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60«C, 1 min; Step 4: 72»C, 2 min; Step 5: steps 2, 3 and 
4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was quantified by 

15 PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA recoveries 
were reamplified using the same conditions as described above. Samples were dilnted with 20% 
dimethysulfoxide (1:2, v/v), and sequenced usmg DYENAMIC energy transfer sequencing primers 
and the DYENAMIC DIRECT kit (Amersham Biosciences) or the ABI PRISM BIGD YE 
Tenninator cycle sequencing ready reaction kit (Applied Biosystems). 

20 In like manner, full length polynucleotides are verified using the above procedure or are used 

to obtain 5'regulatory sequences using the above procedure along with oligonucleotides designed for 
such extension, and an appropriate genomic library. 

IX. Identification of Single Nucleotide Polymorphisms in TRICH Encoding 
Polynucleotides 

25 Common DNA sequence variants known as single nucleotide polymorphisms (SNPs) were 

identified in SEQ ID NO:S0-118 using the LIFESEQ database (Incyte Genomics). Sequences from 
the same gene were clustered together and assembled as described in Example IB, allowing the 
identification of all sequence variants in the gene. An algorithm consisting of a series of filters was 
used to distinguish SNPs from other sequence variants. Preliminary filters removed the majority of 

30 basecall errors by requiring a ininimum Phred quality score of 15, and removed sequence alignment 
errors and errors resulting from improper trimming of vector sequences, chimeras, and splice variants 
An automated procedure of advanced chromosome analysis analysed the original chromatogram files 
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in the vicinity of the putative SNP. Clone error filters used statistically generated algorithms to identify 
errors introduced during laboratory processing, such as those caused by reverse transcriptase, 
polymerase, or somatic mutation. Clustering error filters used statistically generated algorithms to 
identify errors resulting from clustering of close homologs or pseudogenes, or due to contamination by 
5 non-human sequences. A final set of filters removed duplicates and SNPs found in immunoglobulins 
or T-cell receptors. 

Certain SNPs were selected for further characterization by mass spectrometry using the high 
throughput MASSARRAY system (Sequenom, Inc.) to analyze allele frequencies at the SNP sites in 
four different human populations. The Caucasian population comprised 92 individuals (46 male, 46 

10 female), including 83 from Utah, four French, three Venezualan, and two Amish individuals. Hie 
African population comprised 194 individuals (97 male, 97 female), all African Americans. The 
Hispanic population comprised 324 individuals (162 male, 162 female), all Mexican Hispanic. The 
Asian population comprised 126 individuals (64 male, 62 female) with a reported parental breakdown 
of 43% Chinese, 31% Japanese, 13% Korean, 5% Vietnamese, and 8% other Asian. Allele 

15 frequencies were first analyzed in the Caucasian population; in some cases those SNPs which showed 
no allelic variance in this population were not further tested in the other three populations. 
X. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:60-1 18 are employed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 

20 pairs, is specifically described, essentially the same procedure is used with larger nucleotide 
fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 /^Ci of 
[y- 32 P] adenosine triphosphate (Amersham Biosciences), and T4 polynucleotide kinase (DuPont NEN, 
Boston MA). The labeled oligonucleotides are substantially purified using a SEPHADEX G-25 

25 superfine size exclusion dextran bead column (Amersham Biosciences). An aliquot containing 10 7 
counts per minute of the labeled probe is used in a typical membrane-based hybridization analysis of 
human genomic DNA digested with one of the following endonucleases: Ase I, Bgl II, Eco RI, Pst I, 
Xba I, or Pvu H (DuPont MEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 

30 membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40°C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
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Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
compared. 

XI. Microarrays 

The linkage or synthesis of array elements upon a microarray can be achieved utilizing 
5 photohthography, piezoelectric printing (ink-jet printing; see, e.g., Baldeschweiler et al., supra), 
mechanical microspotting technologies, and derivatives thereof. The substrate in each of the 
aforementioned technologies should be uniform and solid with a non-porous surface (Schena, M., ed. 
(1 " 9) p NA Microarrays- A Practical App roach, Oxford University Press, London). Suggested ' • 
subs<rates include silicon, silica, glass shoes, glass chips, and silicon wafers. Alternatively, a procedure 

10 analogous to a dot or slot blot may also be used to arrange and link elements to the surface of a 
substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical array may be 
produced using available methods and machines well known to Arose of ordinary skill in the art and 
may contain any appropriate number of elements (Schena, M. et al. (1995) Science 270:467-470- 
Shalon, D. et al. (1996) Genome Res. 6:639-645; Marshall, A. and J. Hodgson (1998) Nat. Biotechnol 

15 16:27-31). 

Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 
comprisetheelementsofthemicroarray. Fragments or oligomers suitable for hybridization can be ... 
selected using software well known in the art such as LASERGENE software (DNASTAR). The 
array elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 
20 bjologxcal sample are conjugated to a fluorescent label or other molecular tag for ease of detection. 
After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 
fluorescence scanner is used to detect hybridization at each array element. Alternatively, laser 
desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 
complementarity and the relative abundance of each polynucleotide which hybridizes to an element on 
25 the microarray may be assessed. In one embodiment, microarray preparation and usage is described 
in detail below. 

Tissue or Oil fi am p l e Prep aration 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 
poly(A)* RNA is purified using the oligo-(dT) cellulose method. Each poly(Ar RNA sample is 
30 ™erse transcribed usdngMM^ ^ 
strand buffer, 0.03 units/,*! RNase inhibitor, 500 /tM dATP, 500 M M dGTP, 500 pM dTTP 40 
dCTP, 40 „M dCTP-Cy3 (BBS) or dCTP-Cy5 (Amersham Biosciences). The reverse .ascription 
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reaction is performed in a 25 ml volume containing 200 ng poly(A) + RNA with GEMBRIGHT kits 
(Incyte Genomics). Specific control poly(A) + RNAs are synthesized by in vitro transcription from 
non-coding yeast genomic DNA. After incubation at 37° C for 2 hr, each reaction sample (one with 
Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and incubated for 

5 20 minutes at 85° C to the stop the reaction and degrade the RNA. Samples are purified using two 
successive CHROMA SPIN 30 gel filtration spin columns (Clontech, Palo Alto CA) and after 
combining, both reaction samples are ethanol precipitated using 1 ml of glycogen (1 mg/ml), 60 ml 
sodium acetate, and 300 ml of 100% ethanol. The sample is then dried to completion using a 
SpeedVAC (Savant Instruments Inc., Holbrook NY) and resuspended in 14 [il 5X SSC/0.2% SDS. 

10 Microarrav Preparation 

Sequences of the present invention are used to generate array elements. Each array element' 
is amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification uses 
primers complementary to the vector sequences flanking the cDNA insert. Array elements are 
amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 fig. 

15 Amplified array elements are then purified using SEPHACRYL-400 (Amersham Biosciences). 

Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 
slides (Corning) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
, Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, and 

20 coated with 0.05% aminopropyl silane (Sigma-Aldrich, St. Louis MO) in 95% ethanol. Coated slides 
are cured in a 1 10°C oven. 

Array elements are applied to the coated glass substrate using a procedure described in U.S. 
Patent No. 5,807,522, incorporated herein by reference. 1 /il of the array element DNA, at an average 
concentration of 100 ng//*l, is loaded into the open capillary printing element by a high-speed robotic 

25 apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 
Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60° C followed by washes in 0.2% 

30 SDS and distilled water as before. 
Hybridization 
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Hybridization reactions contain 9 /tl of sample mixture consisting of 0.2 /ig each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 
mixture is heated to 65°C for 5 minutes and is aliquoted onto the microarray surface and covered with 
an 1.8 cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly 
larger than a microscope slide. The chamber is kept at 100% humidity internally by the addition of 140 
Hi of 5X SSC in a corner of the chamber. The chamber containing the arrays is incubated for about 
6.5 hours at 60°C. The arrays are washed for 10 min at 45°C in a first wash buffer (IX SSC, 0.1% 
SDS), three times for 10 minutes each at 45°C in a second wash buffer (0.1X SSC), and dried. 
Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 
focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 
containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cm x 1.8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 
Emitted light is split, based on wavelength, into two photo-multiplier tube detectors (PMT R1477, 
Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 
typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 
although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the sample mixture at a known concentration. A specific location on 
the array contains a complementary DNA sequence, allowing the intensity of the signal at that location 
to be correlated with a weight ratio of hybridizing species of 1:100,000. When two samples from 
different sources (e.g., representing test and control cells), each labeled with a different fluorophore, 
are hybridized to a single array for the purpose of identifying genes that are differentially expressed, 
the calibration is done by labeling samples of the calibrating cDNA with the two fluorophores and 
adding identical amounts of each to the hybridization mixture. 
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The output of the photomultiplier tube is digitized using a 12-bit RH-835H analog-to-digital 
(A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 
computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high 
5 signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping emission 
spectra) between the fluorophores using each fluorophore's emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each spot 
is centered in each element of the grid. The fluorescence signal within each element is then integrated 

10 to obtain a numerical value corresponding to the average intensity of the signal. The software used 
for signal analysis is the GEMTOOLS gene expression analysis program (Incyte Genomics). Array 
elements that exhibit at least about a two-fold change in expression, a signal-to-background ratio of at 
least about 2.5, and an element spot size of at least about 40%, are considered to be differentially 
expressed. 

15 Expression 
Breast cancer 

For example, SEQ ID NO:85 showed decreased expression in nonmalignant breast 
adenocarcinoma cells treated with serum tumor necrosis factor alpha (TNF-a ) versus untreated 
nonmalignant breast adenocarcinoma cells as determined by microarray analysis. MCF7 is a 

20 nonmalignant breast adenocarcinoma cell line isolated from the pleural effusion of a 69- year-old 
female. MCF7 has retained characteristics of the mammary epithelium such as the ability to process 
estradiol via cytoplasmic estrogen receptors and the capacity to form domes in culture. MCF7 cells 
were treated with TNF-a for 1, 4, 8, 12, 24, 36, 48, and 72 hours. Treated cells were compared to 
untreated cells kept in culture for the same amount of time. The expression of SEQ ID NO:85 was 

25 reduced by at least two-fold at later time points. In addition, SEQ ID NO:85 showed decreased 
expression in breast carcinoma cells treated with interferon gamma (IFNy) versus untreated breast 
carcinoma cells. T-47D is a breast carcinoma cell line isolated from a pleural effusion obtained from 
a 54-year-old female with an infiltrating ductal carcinoma of the breast. T-47D cells were treated 
with 200 ng/ml IFNy for 1, 4, 8, 24, 48 hours and 3 days. These treated cells were compared to 

30 untreated cells. The expression of SEQ ED NO:85 was reduced by at least two-fold at later time 
points. 
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In a further example, SEQ ID NO:88 showed differential expression in breast cell carcinoma 
cells versus nonmalignant mammary epithelial cells as determined by microarray analysis. Gene 
expression profiles of nonmalignant mammary epithelial cells were compared to the gene expression 
profile of a breast carcinoma line. The cells were grown in defined serum-free H14 medium to 70- 
5 80% confluence prior to RNA harvest. Cell lines compared include T-47D, a breast carcinoma cell 
line isolated from a pleural effusion obtained from a 54-year-old female with an infiltrating ductal 
carcinoma of the breast versus MCF-10A, a breast mammary gland cell line isolated from a 36-year- 
old woman with fibrocystic breast disease, and HMEC, a primary breast epithelial cell line isolated 
from a normal donor. The expression of SEQ ID NO:88 was increased by at least two-fold in T-47D 

10 cells as compared to either HMEC or MCF-10A cells. 

In a further example, SEQ ID NO: 1 12 showed differential expression in breast tumor tissue 
as compared to normal breast tissue from the same donor as determined by microarray analysis. 
Tumor from the right breast was compared to grossly uninvolved breast tissue from the same donor, a 
43-year-old female diagnosed with invasive lobular carcinoma in situ. The expression of SEQ ED 

15 NO:112 was decreased by at least two-fold in the tumor tissue as compared to the matched non-tumor 
tissue. 

In a further example, SEQ ID NO: 1 13 showed differential expression in breast cancer cell 
hues as compared to non-cancerous breast epithelial cell lines as determined by microarray analysis. . 
Cell lines compared included: a) BT-20, a breast carcinoma cell line derived in vitro from the cells 

20 emigrating out of thin slices of tumor mass isolated from a 74-year-old female, b) BT-474, a breast 
ductal carcinoma cell line that was isolated from a solid, invasive ductal carcinoma of the breast 
obtained from a 60-year-old woman, c) BT-483, a breast ductal carcinoma cell line that was isolated 
from a papillary invasive ductal tumor obtained from a 23-year-old normal, menstruating, parous 
female with a family history of breast cancer, d) Hs 578T, a breast ductal carcinoma cell line isolated 

25 from a 74-year-old female with breast carcinoma, e) MCF7, a nonmalignant breast adenocarcinoma 
cell line isolated from the pleural effusion of a 69-year-old female, f) MCF-10A, a breast mammary 
gland Guminal ductal characteristics) cell line isolated from a 36-year-old woman with fibrocystic 
breast disease, g) MDA-MB-468, a breast adenocarcinoma cell line isolated from the pleural effusion 
of a 51-year-old female with metastatic adenocarcinoma of the breast, and h) HMEC, a primary 
30 breast epithelial cell line isolated from a normal donor. Expression of SEQ ID NO: 1 13 was 

decreased by at least two-fold in the BT-474 and BT-483 breast cancer cell lines as compared to the 
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non-malignant HMEC cells. Therefore, SEQ ID NO: 113 is useful in monitoring treatment of, and 
diagnostic assays for, breast cancer. 

Therefore, in various embodiments, SEQ ID NO:85, SEQ ID NO:88, and SEQ ID NO: 1 12- 
1 13 can each be used for one or more of the following: i) monitoring treatment of breast 
5 adenocarcinoma and other cell proliferative disorders, ii) diagnostic assays for breast adenocarcinoma 
and other cell proliferative disorders, and iii) developing therapeutics and/or other treatments for breast 
adenocarcinoma and other cell proliferative disorders. 
Lung cancer 

In another example, SEQ ID NO:85 showed increased expression in lung tumor tissue versus 
10 normal lung tissue. Normal lung tissue from a 68 year-old female was compared to lung tumor from 
the same donor (Roy Castle International Centre for Lung Cancer Research, Liverpool, UK). 

In a further example, SEQ ID NO:92, SEQ ID NO:93, and SEQ ID NO:94 showed 
differential expression in lung tumor tissues compared to normal lung tissue from the same donor as 
determined by microarray analysis. Samples of normal lung were compared to lung tumor from the 
15 same donor (Roy Castle International Centre for Lung Cancer Research, Liverpool, UK). The 
expression of SEQ ID NO:92, SEQ ID NO:93, and SEQ ID NO:94 was decreased by at least two- 
fold in tumor tissue as compared to the matched normal lung for seven different donors in the case of 
. SEQ ID NO:92, and for one donor in the case of SEQ ID NO:93 and SEQ ID NO:94. 

Therefore, in various embodiments, SEQ ID NO:85, and SEQ ID NO:92, SEQ ID NO:93, and 
20 SEQ ID NO:94 can each be used for one or more of the following: i) monitoring treatment of lung 
cancer and other cell proliferative disorders, ii) diagnostic assays for lung cancer and other cell 
proliferative disorders, and iii) developing therapeutics and/or other treatments for lung cancer and 
other cell proliferative disorders. 
Colon cancer 

25 In a further example, SEQ ID NO:85 showed decreased expression in sigmoid colon tumor 

tissue versus normal sigmoid colon tissue. Gene expression profiles were obtained by comparing 
normal sigmoid colon tissue from a 48-year-old female to a sigmoid colon tumor originating from a 
metastatic gastric sarcoma (stromal tumor) from the same donor (Huntsman Cancer Institute, Salt 
Lake City, UT). 

30 In a further example, SEQ ID NO:91, SEQ ID NO:93, and SEQ ID NO:94 showed 

differential expression in colon tumor tissues compared to normal colon tissue from the same donor as 
determined by microarray analysis. Samples of normal colon were compared to colon tumor from the 
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same donor (Huntsman Cancer Institute, Salt Lake City, UT). The expression of SEQ ID NO:91, 
was decreased, and that of SEQ ID NO:93, and SEQ ID NO:94 increased, by at least two-fold in 
tumor tissue as compared to matched normal colon tissue. 

Therefore, in various embodiments, SEQ ID NO:85, SEQ ID NO:91, SEQ ID NO:93, and 
5 SEQ ID NO:94 can each be used for one or more of the following: i) monitoring treatment of colon 
cancer and other cell proliferative disorders, ii) diagnostic assays for colon cancer and other cell 
proliferative disorders, and iii) developing therapeutics and/or other treatments for colon cancer and 
other cell proliferative disorders. 
Ovarian cancer 

10 In another example, SEQ ID NO:88 showed differential expression associated with ovarian 

cancer, as determined by microarray analysis. A normal ovary from a 79 year-old female donor was 
compared to an ovarian tumor from the same donor (Huntsman Cancer Institute, Salt Lake City, UT). 
SEQ ID NO: 88 expression was increased at least two-fold in the tumor tissue as compared to the 
normal tissue. 

15 In a further example, SEQ ID NO:92, SEQ ID NO: 109, and SEQ ID NO: 1 12 showed 

differential expression in ovary tumor versus normal ovary tissue as determined by microarray 
analysis. A normal ovary from a 79-year-old female donor was compared to an ovarian tumor from 
the same donor (Huntsman Cancer Institute, Salt Lake City, UT). The expression of SEQ ID NO:92 
and SEQ ID NO:109 was increased, and the expression of SEQ ID NO: 112 decreased, by at least 

20 two-fold in the ovarian tumor tissue as compared to the matched normal tissue. 

Therefore, in various embodiments, SEQ ID NO:88, SEQ ID NO:92, SEQ ID NO:109, and 
SEQ ID NO: 112 can each be used for one or more of the following: i) monitoring treatment of ovarian 
cancer and other cell proliferative disorders, ii) diagnostic assays for ovarian cancer and other cell 
proliferative disorders, and iii) developing therapeutics and/or other treatments for ovarian cancer and 

25 other cell proliferative disorders. 
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Osteosarcoma 

In a further example, SEQ ID NO: 103, SEQ ID NO:109, and SEQ ID NO:l 18 showed 
differential expression in osteosarcoma associated tissues as compared to normal osteoblasts as 
determined by microarray analysis. Messenger RNA from normal human osteoblasts was compared 

5 with mRNA from biopsy specimens, osteosarcoma tissues, or primary cultures or metastasized 

tissues. A normal osteoblast primary culture, NHOst 5488, was chosen as the reference in the initial 
experiments. One basic set of experiments is defined as the comparison of mRNA from biopsy 
specimen with that of definitive surgical specimen from the same patient Extended study of this basic 
set includes mRNA from primary cell cultures of the definitive surgical specimen, muscle, or cartilage 

10 tissue from the same patient. Biopsy specimens, definitive surgical specimens, or lung metastatic 

tissues from different individuals were also included to reveal individual variability. Expression of SEQ 
ID NO: 103 was increased, and expression of SEQ ID NO: 109 and SEQ ID NO: 118 decreased, by at 
least two-fold in osteosarcoma associated tissues as compared to the normal osteoblasts. 

Therefore, in various embodiments, SEQ ID NO: 103, SEQ ID NO: 109, and SEQ ID NO:118 

15 can be used for one or more of the following: i) monitoring treatment of osteosarcoma and other cell 
proliferative disorders, ii) diagnostic assays for osteosarcoma and other cell proliferative disorders, and 
iii) developing therapeutics and/or other treatments for osteosarcoma and other cell proliferative 
disorders. 

Autoimmune and inflammatory disorders 

20 In a further example, PBMCs from 3 healthy volunteer donors were stimulated in vitro with 

TNF-a for 2 hours. Treated cells were compared to untreated cells from the same donors. In a 
separate experiment, PBMCs from 5 healthy volunteers were incubated in the presence of pro- 
inflammatory cytokines (DL-lp, IL-2, 1L-6, IL-8, IL-12, IL-18, IFN-y, and TNF-a) for 2 and 4 hours. 
Cytokine-treated PBMCs were compared to untreated PBMCs from the same donors. In both cases, 

25 the expression of SEQ ID NO:93 and SEQ ID NO:94 was increased at least two-fold in the treated 
cells as compared to the untreated cells. Therefore, SEQ ID NO:93 and SEQ ID NO:94 are useful in 
monitoring treatment of, and diagnostic assays for, autoimmune and inflammatory disorders. 
XII. Complementary Polynucleotides 

Sequences complementary to the TRICH-encoding sequences, or any parts thereof, are used 
30 to detect, decrease, or inhibit expression of naturally occurring TRICH. Although use of 
oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same 
procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides are 
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designed using OLJGO 4.06 software (National Biosciences) and the coding sequence of TRICH. To 
inhibit transcription, a complementary oligonucleotide is designed from the most unique 5' sequence 
and used to prevent promoter binding to the coding sequence. To inhibit translation, a complementary 
oligonucleotide is designed to prevent ribosomal binding to the TRICH-encoding transcript. 

5 XIII. Expression of TRICH 

Expression and purification of TRICH is achieved using bacterial or virus-based expression 
systems. For expression of TRICH in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 
transcription. Examples of such promoters include, but are not limited to, the trp-lac (tac) hybrid 

10 promoter and the T5 or 17 bacteriophage promoter in conjunction with the lac operator regulatory 
element Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express TRICH upon induction with isopropyl beta-D- 
thiogalactopyranoside (IPTG). Expression of TRICH in eukaryotic cells is achieved by infecting . 
insect or mammalian cell lines with recombinant Autographica californica nuclear polyhidrosis virus 

15 (AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 
replaced with cDNA encoding TRICH by either homologous recombination or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
infect Spodopterafrugiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 

20 Infection of the latter requires additional genetic modifications to baculovirus (Engelhard, E.K. et al. 
(1994) Proc. Nad. Acad. Sci. USA 91 :3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7: 1937- 
1945). 

In most expression systems, TRICH is synthesized as a fusion protein with, e.g., glutathione 
S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 

25 affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton 
enzyme from Schistosoma japonicum, enables the purification of fusion proteins on immobilized 
glutathione under conditions that maintain protein activity and antigenicity (Amersham Biosciences). 
Following purification, the GST moiety can be proteolytically cleaved from TRICH at specifically 
engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity purification using 

30 commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6-His, a 
stretch of six consecutive histidine residues, enables purification on metal-chelate resins (QIAGEN). 
Methods for protein expression and purification are discussed in Ausubel et al. (supra, ch. 10 and 16). 
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Purified TRICH obtained by these methods can be used directly in the assays shown in Examples 
XVII, XVm, and XIX, where applicable. 
XIV. Functional Assays 

TRICH function is assessed by expressing the sequences encoding TRICH at physiologically 

5 elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 
vector containing a strong promoter that drives high levels of cDNA expression. Vectors of choice 
include PCMV SPORT plasmid (Invitrogen, Carlsbad CA) and PCR3.1 plasmid (Invitrogen), both of 
which contain the cytomegalovirus promoter. 5-10 of recombinant vector are transiently 
transfected into a human cell line, for example, an endothelial or hematopoietic cell line, using either 

10 liposome formulations or electroporation. 1-2 jug of an additional plasmid containing sequences 
. encoding a marker protein are co-transfected. Expression of a marker protein provides a means to 
distinguish transfected cells from nontransfected cells and is a reliable predictor of cDNA expression 
from the recombinant vector. Marker proteins of choice include, e.g., Green Fluorescent Protein 
(GFP; Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), an automated, laser 

15 , optics-based technique, is used to identify transfected cells expressing GFP or CD64-GFP and to 
evaluate the apoptotic state of the cells and other cellular properties. FCM detects and quantifies the 
. uptake of fluorescent molecules that diagnose events preceding or coincident with cell death. These . 
events include changes in nuclear DNA content as measured by staining of DNA with propidium 
iodide;, changes in cell size and granularity as measured by forward light scatter and 90 degree side 

20 light scatter; down-regulation of DNA synthesis as measured by decrease in bromodeoxyuridine 
uptake; alterations in expression of cell surface and intracellular proteins as measured by reactivity 
with specific antibodies; and alterations in plasma membrane composition as measured by the binding 
of fluorescein-conjugated Annexin V protein to the cell surface. Methods in flow cytometry are 
discussed in Ormerod, M.G. (1994; Row Cytometry, Oxford, New York NY). 

25 The influence of TRICH on gene expression can be assessed using highly purified populations 

of cells transfected with sequences encoding TRICH and either CD64 or CD64-GFP. CD64 and 
CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions of human 
immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using 
magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake Success 

30 NY). mRNA can be purified from the cells using methods well known by those of skill in the art. 
Expression of mRNA encoding TRICH and other genes of interest can be analyzed by northern 
analysis or microarray techniques. 
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XV. Production of TRICH Specific Antibodies 

TRICH substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 
immunize animals (e.g., rabbits, mice, etc.) and to produce antibodies using standard protocols. 

5 Alternatively, the TRICH amino acid sequence is analyzed using LASERGENE software 

(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art Methods for 
selection of appropriate epitopes, such as those near the C-tenninus or in hydrophilic regions are well 
described in the art (Ausubel et al., supra, ch. 11). 

10 Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431 A 

peptide synthesizer (Applied Biosystems) using FMOC chemistry and coupled to KLH (Sigma- 
Aldrich, St. Louis MO) by reaction with N-mMeimidobenzoyl-N-hydroxysuccinimide ester (MBS) to 
increase immunogenicity (Ausubel et al., supra). Rabbits are immunized with the oligopeptide-KLH 
complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide and anti-TRICH 

15 activity by, for example, binding the peptide or TRICH to a substrate, blocking with 1% BSA, reacting 
with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 

XVI. Purification of Naturally Occurring TRICH Using Specific Antibodies 
Naturally occurring or recombinant TRICH is substantially purified by immunoaffinity 

chromatography using antibodies specific for TRICH. An immunoaffinity column is constructed by 
20 covalendy coupling anti-TRICH antibody to an activated chromatographic resin, such as 

CNBr-activated SEPHAROSE (Amersham Biosciences). After the coupling, the resin is blocked and 

washed according to the manufacturer's instructions. 

Media containing TRICH are passed over the immunoaffinity column, and the column is 

washed under conditions that allow the preferential absorbance of TRICH (e.g., high ionic strength 
25 buffers in the presence of detergent). The column is eluted under conditions that disrupt 

antibody/TRICH binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such 

as urea or thiocyanate ion), and TRICH is collected. 

XVII. Identification of Molecules Which Interact with TRICH 

Molecules which interact with TRICH may include transporter substrates, agonists or 
30 antagonists, modulatory proteins such as Gpy proteins (Reimann, supra) or proteins involved in 
TRICH localization or clustering such as MAGUKs (Craven, supra). TRICH, or biologically active 
fragments thereof, are labeled with 125 I Bolton-Hunter reagent. (See, e.g., Bolton A.E. and W.M. 
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Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules previously arrayed in the wells of a 
multi-well plate are incubated with the labeled TRICH, washed, and any wells with labeled TRICH 
complex are assayed. Data obtained using different concentrations of TRICH are used to calculate 
values for the number, affinity, and association of TRICH with the candidate molecules. 

5 Alternatively, proteins that interact with TRICH are isolated using the yeast 2-hybrid system 

(Fields, S. and O. Song (1989) Nature 340:245-246). TRICH, or fragments thereof, are expressed as 
fusion proteins with the DNA binding domain of Gal4 or lexA, and potential interacting proteins are 
expressed as fusion proteins with an activation domain. Interactions between the TRICH fusion 
protein and the TRICH interacting proteins (fusion proteins with an activation domain) reconstitute a 

10 transactivation function that is observed by expression of a reporter gene. Yeast 2-hybrid systems are 
commercially available, and methods for use of the yeast 2-hybrid system with ion channel proteins 
are discussed in Niethammer, M. and M. Sheng (1998, Meth. Enzymol. 293: 104-122). 

TRICH may also be used in the PATHCALLING process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a Mgh-throughput manner to determine all interactions 

15 between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. 
Patent No. 6,057,101). 

Potential TRICH agonists or antagonists may be tested for activation or inhibition of TRICH 
ion channel activity using the assays described in section XVIH. 
XVIH. Demonstration of TRICH Activity 

20 Ion channel activity of TRICH is demonstrated using an electrophysiological assay for ion 

conductance. TRICH can be expressed by transforming a mammalian cell line such as COS7, HeLa 
or CHO with a eukaryotic expression vector encoding TRICH. Eukaryotic expression vectors are 
commercially available, and the techniques to introduce them into cells are well known to those skilled 
in the art. A second plasmid which expresses any one of a number of marker genes, such as B- 

25 galactosidase, is co-transformed into the cells to allow rapid identification of those cells which have 
taken up and expressed the foreign DNA. The cells are incubated for 48-72 hours after 
transformation under conditions appropriate for the cell line to allow expression and accumulation of 
TRICH and 8-galactosidase. 

Transformed cells expressing B-galactosidase are stained blue when a suitable colorimetric 

30 substrate is added to the culture media under conditions that are well known in the art. Stained cells 
are tested for differences in membrane conductance by electrophysiological techniques that are well 
known in the art. Untransformed cells, and/or cells transformed with either vector sequences alone or 
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B-galactosidase sequences alone, are used as controls and tested in parallel. Cells expressing TRICH 
will have higher anion or cation conductance relative to control cells. The contribution of TRICH to 
conductance can be confirmed by incubating the cells using antibodies specific for TRICH. The 
antibodies will bind to the extracellular side of TRICH, thereby blocking the pore in the ion channel, 
5 and the associated conductance. 

Alternatively, ion channel activity of TRICH is measured as current flow across a TRICH- 
contaming Xenopus laevis oocyte membrane using the two-electrode voltage-clamp technique (Ishi et 
al, supra] Jegla, T. and L. Salkoff (1997) J. Neurosci. 17:32-44). TRICH is subcloned into an 
appropriate Xenopus oocyte expression vector, such as pBF, and 0.5-5 ng of mRNA is injected into 

10 mature stage IV oocytes. Injected oocytes are incubated at 18 °C for 1-5 days. Inside-out 

macropatches are excised into an intracellular solution containing 116 mM K-gluconate, 4 mM KC1, 
and 10 mM Hepes (pH 7.2). The intracellular solution is supplemented with varying concentrations of 
the TRICH mediator, such as cAMP, cGMP, or Ca +2 (in the form of CaClJ, where appropriate. 
Electrode resistance is set at 2-5 M£2 and electrodes are filled with the intracellular solution lacking 

15 mediator. Experiments are performed at room temperature from a holding potential of 0 mV. Voltage 
ramps (2.5 s) from -100 to 100 mV are acquired at a sampling frequency of 500 Hz. Current 
measured is proportional to the activity of TRICH in the assay. 

Transport activity of TRICH is assayed by measuring uptake of labeled substrates into 
Xenopus laevis oocytes. Oocytes at stages V and VI are injected with TRICH mRNA (10 ng per 

20 oocyte) and incubated for 3 days at 18°C in OR2 medium (82.5mM NaCl, 2.5 mM KC1, ImM CaCl 2 , 
ImM MgCl 2 , ImM Na 2 HP0 4 , 5 mM Hepes, 3.8 mM NaOH , 50/ig/ml gentamycin, pH 7.8) to allow 
expression of TRICH. Oocytes are then transferred to standard uptake medium (lOOmM NaCl, 2 
mM KC1, ImM CaCl 2 , ImM MgCl 2 , 10 mM Hepes/Tris pH 7.5). Uptake of various substrates (e.g., 
amino acids, sugars, drugs, ions, and neurotransmitters) is initiated by adding labeled substrate (e.g. 

25 radiolabeled with 3 H, fluorescendy labeled with rhodamine, etc.) to the oocytes. After incubating for 
30 minutes, uptake is terminated by washing the oocytes three times in Na + ~free medium, measuring 
the incorporated label, and comparing with controls. TRICH activity is proportional to the level of 
internalized labeled substrate. 

ATPase activity associated with TRICH can be measured by hydrolysis of radiolabeled ATP- 

30 [Y- 32 P], separation of the hydrolysis products by chromatographic methods, and quantitation of the 
recovered 32 P using a scintillation counter. The reaction mixture contains ATP-[y- 32 P] and varying 
amounts of TRICH in a suitable buffer incubated at 37°C for a suitable period of time. Hie reaction 
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is terminated by acid precipitation with trichloroacetic acid and then neutralized with base, and an 
aliquot of the reaction mixture is subjected to membrane or filter paper-based chromatography to 
separate the reaction products. The amount of 32 P liberated is counted in a scintillation counter. The 
amount of radioactivity recovered is proportional to the ATPase activity of TRICH in the assay. 

5 Lipocalin activity of TRICH is measured by ligand fluorescence enhancement 

spectrofluorometry (Lin et al. (1997) Molecular Vision 3:17). Examples of ligands include retinol 
(Sigma, St Louis MO) and 16-anthryloxy-palmitic acid (16-AP) (Molecular Probes Inc., Eugene OR). 
Ligand is dissolved in 100% ethanol and its concentration is estimated using known extinction 
coefficents (retinol: 46,000 A/M/cm at 325 nm; 16-AP: 8,200 A/M/cm at 361 nm). A 700 /xl aliquot of 

10 1 liM TRICH in 10 mM Tris (pH 7.5), 2 mM EDTA, and 500 mM NaCl is placed in a 1 cm path 
length quartz cuvette and 1 fil aliquots of ligand solution are added. Fluorescence is measured 100 
seconds after each addition until readings are stable. Change in fluorescence per unit change in ligand 
concentration is proportional to TRICH activity. 

XIX. Identification of TRICH Agonists and Antagonists 

15 TRICH is expressed in a eukaryotic cell line such as CHO (Chinese Hamster Ovary) or HEK 

(Human Embryonic Kidney) 293. Ion channel activity of the transformed cells is measured in the • 
presence and absence of candidate agonists or antagonists. Ion channel activity is assayed using- 
patch clamp methods well known in the art or as described in Example XVIII. Alternatively,, ion 
channel activity is assayed using fluorescent techniques that measure ion flux across the cell 

20 membrane (Velicelebi, G. et al. (1999) Meth. Enzymol. 294:20-47; West, M.R. and C.R. Molloy 

(1996) Anal. Biochem. 241:51-58). These assays may be adapted for high-throughput screening using 
microplates. Changes in internal ion concentration are measured using fluorescent dyes such as the 
Ca 2 * indicator Fluo-4 AM, sodium-sensitive dyes such as SBFI and sodium green, or the CI" indicator 
MQAE (all available from Molecular Probes) in combination with the HJPR fluorimetric plate reading 

25 system (Molecular Devices). In a more generic version of this assay, changes in membrane potential 
caused by ionic flux across the plasma membrane are measured using oxonyl dyes such as DiB AC 4 
(Molecular Probes). DiBAC 4 equilibrates between the extracellular solution and cellular sites 
according to the cellular membrane potential. The dye's fluorescence intensity is 20-fold greater 
when bound to hydrophobic intracellular sites, allowing detection of DiBAC 4 entry into the cell 

30 (Gonzalez, J.E. and P. A. Negulescu (1998) Cuir. Opin. Biotechnol. 9:624-631). Candidate agonists or 
antagonists may be selected from known ion channel agonists or antagonists, peptide libraries, or 
combinatorial chemical libraries. 
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Various modifications and variations of the described compositions, methods, and 
systems of the invention will be apparent to those skilled in the art without departing from the scope 
and spirit of the invention. It will be appreciated that the invention provides novel and useful proteins, 

5 and their encoding polynucleotides, which can be used in the drug discovery process, as well as 
methods for using these compositions for the detection, diagnosis, and treatment of diseases and 
conditions. Although the invention has been described in connection with certain embodiments, it 
should be understood that the invention as claimed should not be unduly limited to such specific 
embodiments. Nor should the description of such embodiments be considered exhaustive or limit the 

10 invention to the precise forms disclosed. Furthermore, elements from one embodiment can be readily 
recombined with elements from one or more other embodiments. Such combinations can form a 
number of embodiments within the scope of the invention. It is intended that the scope of the 
invention be defined by the following claims and their equivalents. 
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What is claimed is: 

1 . An isolated polypeptide selected from the group consisting of: 

a) a polypeptide comprising an amino acid sequence selected from the group consisting 
ofSEQEDNO:l-59, 

b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% 
identical to an amino acid sequence selected from the group consisting of SEQ ED 
NO:l, SEQ ID NO:4, SEQ ED NO:6, SEQ ID NO:10-11, SEQ ID NO:20-21, SEQ ID 
NO:23, SEQ ID NO:25-29, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:35, SEQ ID 
NO:38-40, SEQ ID NO:43-45, SEQ ID NO:49-51, and SEQ ID NO:56, 

c) a polypeptide comprising a naturally occurring amino acid sequence at least 91% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:3andSEQIDNO:54, 

d) a polypeptide comprising a naturally occurring amino acid sequence at least 92% 
identical to an amino acid sequence selected from the group consisting of SEQ ED 
NO:19 and SEQ ID NO:22, 

e) a polypeptide comprising a naturally occurring amino acid sequence at least 93% 
identical to the amino acid sequence of SEQ ID NO:46, 

f) a polypeptide comprising a naturally occurring amino acid sequence at least 94% 
identical to an amino acid sequence selected from the group consisting of SEQ ED 
NO:14-15, 

g) a polypeptide comprising a naturally occurring amino acid sequence at least 95% 
identical to an amino acid sequence selected from the group consisting of SEQ ED 
NO:13, SEQ ID NO:18, SEQ ID NO:36-37 and SEQ ID NO:52, 

h) a polypeptide comprising a naturally occurring amino acid sequence at least 96% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:2 and SEQ ED NO:16, 

i) a polypeptide comprising a naturally occurring amino acid sequence at least 97% 
identical to an amino acid sequence selected from the group consisting of SEQ ED 
NO:30, SEQ ID NO:32 and SEQ ID NO:57-58, 

j) a polypeptide comprising a naturally occurring amino acid sequence at least 98% 
identical to the amino acid sequence of SEQ ED NO:59, 
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k) a polypeptide comprising a naturally occurring amino acid sequence at least 99% 

identical to the amino acid sequence of SEQ ID NO: 12, 
1) a polypeptide consisting essentially of a naturally occurring amino acid sequence at 

least 90% identical to an amino acid sequence selected from the group consisting of 

SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:34, SEQ ID NO:41-42, 

SEQ ID NO:47, and SEQ ID NO:55. 
m) a biologically active fragment of a polypeptide having an amino acid sequence 

selected from the group consisting of SEQ ID NO: 1-59, and 
n) an immunogenic fragment of a polypeptide having an amino acid sequence selected 

from the group consisting of SEQ ID NO: 1-59. 

2. An isolated polypeptide of claim 1 comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-59. 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

5. An isolated polynucleotide of claim 4 comprising a polynucleotide sequence selected from 
the group consisting of SEQ ID NO:60-1 18. 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

9. A method of producing a polypeptide of claim 1, the method comprising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein 
said cell is transformed with a recombinant polynucleotide, and said recombinant 
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polynucleotide comprises a promoter sequence operably linked to a polynucleotide 
encoding the polypeptide of claim 1, and 
b) recovering the polypeptide so expressed. 

10. A method of claim 9, wherein the polypeptide comprises an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-59. 

11. An isolated antibody which specifically binds to a polypeptide of claim 1. 

12. An isolated polynucleotide selected from the group consisting of: 

a) a polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:60-1 18, 

b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
90% identical to a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:60, SEQ ID NO:65, SEQ ID NO:67, SEQ ID NO:73, SEQ ID NO:76, SEQ 
ID NO:78-79, SEQ ID NO:81, SEQ ID NO:83-85, SEQ ED NO:90-92, SEQ ID 

NO: 108 and SEQ ID NO: 113, 

c) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
91 % identical to the polynucleotide sequence of SEQ ID NO:70, 

d) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
92% identical to a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:71 and SEQ ID NO:87, 

e) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
93% identical to the polynucleotide sequence of SEQ ID NO: 1 15, 

f) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
94% identical to a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:68 and SEQ ID NO: 117, 

g) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
95% identical to a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:63, SEQ ID NO:66 and SEQ ID NO:l 18, 
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h) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
96% identical to a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:64 and SEQ ID NO:75, 

i) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
5 97% identical to the polynucleotide sequence of SEQ ID NO: 106, 

j) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 

98% identical to a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:80, SEQ ID NO:88, SEQ ID NO:95, SEQ ID NO:109 and SEQ ID NO:116, 
k) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
10 99% identical to a polynucleotide sequence selected from the group consisting of SEQ 

ID NO:62, SEQ ID NO:82, SEQ ID NO:86, SEQ ID NO:89, SEQ ID NO:96, SEQ 
ID NO:99 and SEQ ID NO:104, 
1) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
• 90% identical to a polynucleotide sequence selected from the group consisting of SEQ 

15 10 NO:61 > SEQ ID NO:69, SEQ ID NO:72, SEQ ID NO:74, SEQ ID NO:93-94, SEQ 

ID NO:97-98, SEQ ID NO:100-103, SEQ ID NO:105, SEQ ID NO:107, SEQ ID 
NO:110-112andSEQIDNO:114, 
m) a polynucleotide complementary to a polynucleotide of a), 
n) a polynucleotide complementary to a polynucleotide of b), 
20 °) a polynucleotide complementary to a polynucleotide of c), 

p) a polynucleotide complementary to a polynucleotide of d), 
q) a polynucleotide complementary to a polynucleotide of e), 
r) a polynucleotide complementary to a polynucleotide of f), 
s) a polynucleotide complementary to a polynucleotide of g), 
25 0 a polynucleotide complementary to a polynucleotide of h), 

u) a polynucleotide complementary to a polynucleotide of i), 
v) a polynucleotide complementary to a polynucleotide of j), 
w) a polynucleotide complementary to a polynucleotide of k), 
x) a polynucleotide complementary to a polynucleotide of 1), and 
30 y) an RNA equivalent of a)-x). 
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13. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 12. 

14. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
5 having a sequence of a polynucleotide of claim 12, the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and 
which probe specifically hybridizes to said target polynucleotide, under conditions 
whereby a hybridization complex is formed between said probe and said target 

10 polynucleotide or fragments thereof, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if 
present, the amount thereof . 



15. A method of claim 14, wherein the probe comprises at least 60 contiguous nucleotides. 



15 



16. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain 
reaction amplification, and 
20 b) detecting the presence or absence of said amplified target polynucleotide or fragment 

thereof, and, optionally, if present, the amount thereof. 

17. A composition comprising a polypeptide of claim 1 and a pharmaceutical^ acceptable 
excipient. 

25 

18. A composition of claim 17, wherein the polypeptide comprises an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-59. 

19. A method for treating a disease or condition associated with decreased expression of 

30 functional TEUCH, comprising administering to a patient in need of such treatment the composition of 
claim 17. 
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20. A method of screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting agonist activity in the sample. 

5 

21. A composition comprising an agonist compound identified by a method of claim 20 and a 
pharmaceutical^ acceptable excipient 

22. A method for treating a disease or condition associated with decreased expression of 

10 functional TRICH, comprising administering to a patient in need of such treatment a composition of 
claim 21. 

23. A method of screening a compound for effectiveness as an antagonist of a polypeptide of 
claim lj the method comprising: 

15 a ) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) .. detecting antagonist activity in the sample. 

24. A composition comprising an antagonist compound identified by a method of claim 23 and 
a pharmaceutically acceptable excipient 

20 

25. A method for treating a disease or condition associated with overexpression of functional 
TRICH, comprising administering to a patient in need of such treatment a composition of claim 24. 

26. A method of screening for a compound that specifically binds to the polypeptide of claim 
25 1 , the method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 
conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test compound, thereby 
identifying a compound that specifically binds to the polypeptide of claim 1. 



30 



27. A method of screening for a compound that modulates the activity of the polypeptide of 
claim 1, the method comprising: 
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a) combining the polypeptide of claim 1 with at least one test compound under conditions 
permissive for the activity of the polypeptide of claim 1, 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test 
compound, and 

c) comparing the activity of the polypeptide of claim 1 in the presence of the test 
compound with the activity of the polypeptide of claim 1 in the absence of the test 
compound, wherein a change in the activity of the polypeptide of claim 1 in the 
presence of the test compound is indicative of a compound that modulates the activity 
of the polypeptide of claim 1. 

28. A method of screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 
comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, under 
conditions suitable for the expression of the target polynucleotide, 

b) detecting altered expression of the target polynucleotide, and 

c) comparing the expression of the target polynucleotide in the presence of varying 
amounts of the compound and in the absence of the compound. 

29. A method of assessing toxicity of a test compound; the method comprising: 

a) treating a biological sample containing nucleic acids with the test compound, 

b) hybridizing the nucleic acids of the treated biological sample with a probe comprising 
at least 20 contiguous nucleotides of a polynucleotide of claim 12 under conditions 
whereby a specific hybridization complex is formed between said probe and a target 
polynucleotide in the biological sample, said target polynucleotide comprising a 
polynucleotide sequence of a polynucleotide of claim 12 or fragment thereof, 

c) quantifying the amount of hybridization complex, and 

d) comparing the amount of hybridization complex in the treated biological sample with 
the amount of hybridization complex in an untreated biological sample, wherein a 
difference in the amount of hybridization complex in the treated biological sample is 
indicative of toxicity of the test compound. 



276 



WO 03/083085 



PCT/US03/09797 



30. A method for a diagnostic test for a condition or disease associated with the expression of 
TRICH in a biological sample, the method comprising: 

a) combining the biological sample with an antibody of claim 11, under conditions suitable 
for the antibody to bind the polypeptide and form an antibodyrpolypeptide complex, 
and 

b) detecting the complex, wherein the presence of the complex correlates with the 
presence of the polypeptide in the biological sample. 



31. The antibody of claim 11, wherein the antibody is: 
10 a) a chimeric antibody, 

b) a single chain antibody, 

c) a Fab fragment, 

d) a F(ab' ) 2 fragment, or 

e) a humanized antibody. 

15 

32. A composition comprising an antibody of claim 11 and an acceptable excipient. 



33. A method of diagnosing a condition or disease associated with the expression of TRICH 
in a subject, comprising administering to said subject an effective amount of the composition of claim 

20 32. 

34. A composition of claim 32, further comprising a label. 



35. A method of diagnosing a condition or disease associated with the expression of TRICH 
25 in a subject, comprising administering to said subject an effective amount of the composition of claim 
34. 



36. A method of preparing a polyclonal antibody with the specificity of the antibody of claim 
1 1, the method comprising: 
30 a) immunizing an animal with a polypeptide consisting of an amino acid sequence 

selected from the group consisting of SEQ ID NO: 1-59, or an immunogenic fragment 
thereof, under conditions to elicit an antibody response, 
b) isolating antibodies from the animal, and 
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c) screening the isolated antibodies with the polypeptide, thereby identifying a polyclonal 
antibody which specifically binds to a polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-59. 

37. A polyclonal antibody produced by a method of claim 36. 

38. A composition comprising the polyclonal antibody of claim 37 and a suitable carrier. 

39. A method of making a monoclonal antibody with the specificity of the antibody of claim 
11, the method comprising: 

a) immunizing an animal with a polypeptide consisting of an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-59, or an immunogenic fragment 
thereof, under conditions to elicit an antibody response, 

b) isolating antibody producing cells from the animal, 

c) fusing the antibody producing cells with immortalized cells to form monoclonal 
antibody-producing hybridoma cells, 

d) culturing the hybridoma cells, and 

e) isolating from the culture monoclonal antibody which specifically binds to a 
polypeptide comprising an amino acid sequence selected from the group, consisting of 
SEQ ED NO: 1-59. 

40. A monoclonal antibody produced by a method of claim 39. 

41. A composition comprising the monoclonal antibody of claim 40 and a suitable carrier. 

42. The antibody of claim 1 1, wherein the antibody is produced by screening a Fab expression 

library. 

43. The antibody of claim 11, wherein the antibody is produced by screening a recombinant 
immunoglobulin library. 

44. A method of detecting a polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-59 in a sample, the method comprising: 
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a) incubating the antibody of claim 1 1 with the sample under conditions to allow specific 
binding of the antibody and the polypeptide, and 

b) detecting specific binding, wherein specific binding indicates the presence of a 
polypeptide comprising an amino acid sequence selected from the group consisting of 
SEQ ID NO:l-59 in the sample. 



45. A method of purifying a polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-59 from a sample, the method comprising: 

a) incubating the antibody of claim 1 1 with the sample under conditions to allow specific 
binding of the antibody and the polypeptide, and 

b) separating the antibody from the sample and obtaining the purified polypeptide 
comprising an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-59. 

46. A microarray wherein at least one element of the microarray is a polynucleotide of claim 

13. 



47. A method of generating an expression profile of a sample which contains polynucleotides, 
the method comprising: 

a) labeling the polynucleotides of the sample, 

b) contacting the elements of the microarray of claim 46 with the labeled polynucleotides 
of the sample under conditions suitable for the formation of a hybridization complex, 
and 

c) quantifying the expression of the polynucleotides in the sample. 



48. An array comprising different nucleotide molecules affixed in distinct physical locations 
on a solid substrate, wherein at least one of said nucleotide molecules comprises a first oligonucleotide 
or polynucleotide sequence specifically hybridizable with at least 30 contiguous nucleotides of a target 
polynucleotide, and wherein said target polynucleotide is a polynucleotide of claim 12. 



49. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
completely complementary to at least 30 contiguous nucleotides of said target polynucleotide. 
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50. An airay of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
completely complementary to at least 60 contiguous nucleotides of said target polynucleotide. 

51. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
5 completely complementary to said target polynucleotide. 

52. An array of claim 48, which is a microarray. 

53. An array of claim 48, further comprising said target polynucleotide hybridized to a 
10 nucleotide molecule comprising said first oligonucleotide or polynucleotide sequence. 

54. An array of claim 48, wherein a linker joins at least one of said nucleotide molecules to 
said solid substrate. 



15 55; An array of claim 48, wherein each distinct physical location on the substrate contains 

multiple nucleotide molecules, and the multiple nucleotide molecules at any single distinct physical 
location have the same sequence, and each distinct physical location on the substrate contains 
nucleotide molecules having a sequence which differs from the sequence of nucleotide molecules at 
another distinct physical location on the substrate. 

20 

56. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:l. 

57. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:2. 
25 58. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:3. 

59. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:4. 

60. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:5. 

30 

61. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:6. 

62. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:7. 
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63. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:8. 

64. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:9. 

65. Apolypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 10. 

66. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:ll. 

67. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:12. 

68. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:13. 

69. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:14. 

70. A polypeptide of claim ^comprising the amino acid sequence of SEQ ID NO:15. 

71. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 16. 

i 

72. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:17. 

73. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 18. 

74. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:19. 

75. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:20. 

76. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:21. 

77. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:22. 

78. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:23. 

79. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ED NO:24. 
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80. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:25. 

81. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:26. 

82. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:27. 

83. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:28. 

84. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:29. 

85. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:30. 

86. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:31. 

87. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:32. 

88. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:33. 

89. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:34. 

90. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:35. 

91 . A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:36. 

92. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:37. 

93. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:38. 

94. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:39. 

95. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:40. 

96. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:41 . 
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97. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:42. 

98. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:43. 

99. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:44. 

100. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:45. 

101. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:46. 

102. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:47. 

103. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:48. 

104. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:49. 

105. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:50. 

106. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:51. 

107. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:52. 

108. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:53. 

109. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:54. 

110. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:55. 

111. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:56. 

1 12. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:57. 

1 13. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:58. 
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114. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:59. 

115. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:60. 

5 

116. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:61. 

117. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

10 NO:62. 

118. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:63, 

15 1 19. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:64. 

120. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:65. 

20 

121. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:66. 

122. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

25 NO:67. 

123. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:68. 

30 124. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:69. 
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125. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:70. 

126. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

5 NO:71. 

127. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:72. 

10 128. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:73. 

129. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:74. 

15 

130. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:75. 

131. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

20 NO:76. 

132. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:77. 

25 133. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:78. 

134. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ED 

NO:79. 

30 

135. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:80. 

285 



WO 03/083085 PCT/US03/09797 

136. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:81. 

137. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

5 NO:82. 

138. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:83. 

10 139. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:84. 

140. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:85. 

15 

141. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ED 

NO:86. 

142. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

20 NO:87. 

143. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:88. 

25 144. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:89. 

145. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:90. 

30 

146. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:91. 
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147. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:92. 

148. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

5 NO:93. 

149. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:94. 

10 150. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:95. 

151. A polynucleotide^ claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:96. 

15 

152. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:97. 

153. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

20 NO:98. 

154. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:99. 

25 155. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO: 100. 

156. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:10L 

30 

157. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:102. 
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158. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO: 103. 

159. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
5 NO: 104. 

160. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO: 105. 

10 161. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO: 106. 

162. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO: 107. 

15 

163. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:108. 

164. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
20 NO: 109. 

165. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO: 110. 

25 166. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:lll. 

167. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO: 112. 

30 

168. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO: 113. 
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169. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:114. 

170. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
5 NO:115. 

171. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:116. 

10 172. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:117. 

173. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:118. 

15 
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<110> INCYTE CORPORATION 
MARQUIS , Joseph P. 
LEE, Soo Y. 
EMERLING , Brooke M. 
HAFALLIA, April J. A. 
KHARE , Reena 
KABLE, Amy E. 
RICHARDSON, Thomas W. 
SWARNAKAR, Anita 
CHAWLA, Narinder K. 
BECHA, Shanya D. 
MASON, Patricia M. 
ELLIOTT, Vicki S. 
RAMKUMAR , Jayal axmi 
GRIFFIN, Jennifer A. 
TRAN, Uyen K. 
ISON, Craig H. 
LINDQUIST, Erika A. 
JIANG, Xin 
JACKSON, Alan A. 
WILSON, Amy D. 
JIN, Pei 
CHANG, Hsin-Ru 

<120> TRANSPORTERS AND ION CHANNELS 

<130> PF-1397 PCT 

<140> To Be Assigned 
<141> Herewith 

<150> US 60/368,840 
<151> 2002-03-28 

<150> US 60/375,637 
<151> 2002-04-26 

<160> 118 

<170> PERL Program 

<210> 1 
<211> 195 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7509332CD1 

<400> 1 

Met Asp Met Ala Trp Gin Met Met Gin Leu Leu Leu Leu Ala Leu 

15 10 15 

Val Thr Ala Ala Gly Ser Ala Gin Pro Arg Ser Ala Arg Ala Arg 

20 25 30 

Thr Asp Leu Leu Asn Val Cys Met Asn Ala Lys His His Lys Thr 

35 40 45 
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Gin Pro Ser Pro Glu Asp Glu Leu Tyr Gly Gin Cys Ser Pro Trp 

50 55" 60 

Lys Lys Asn Ala Cys Cys Thr Ala Ser Thr Ser Gin Glu Leu His 
65 70 75 

Lys Asp Thr Ser Arg Leu Tyr Asn Phe Asn Trp Asp His Cys Glu 
80 85 90 

Arg Trp Trp Glu Asp Cys Arg Thr Ser Tyr Thr Cys Lys Ser Asn 
95 100 105 

Trp His Lys Gly Trp Asn Trp Thr Ser Gly He Asn Glu Cys Pro 

110 H5 120 

Ala Gly Ala Leu Cys Ser Thr Phe Glu Ser Tyr Phe Pro Thr Pro 

125 130 135 

Ala Ala Leu Cys Glu Gly Leu Trp Ser His Ser Phe Lys Val Ser 

140 145 150 

Asn Tyr Ser Arg Gly Ser Gly Arg Cys He Gin Met Trp Phe Asp 

155 160 165 

Ser Ala Gin Gly Asn Pro Asn Glu Glu Val Ala Lys Phe Tyr Ala 

170 175 180 

Ala Ala Met Asn Ala Gly Ala Pro Ser Arg Gly He He Asp Ser 

185 190 195 



<210> 2 
<211> 138 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7509102CD1 

<400> 2 

Met Gly Pro Ser Cys Pro Val Phe Leu Ser Phe Thr Lys Leu Gly 

15 10 15 

Leu Trp Trp Leu Leu Leu Thr Pro Ala Gly Gly Glu Glu Ala Lys 

20 25 30 

Arg Pro Pro Pro Arg Ala Pro Gly Asp Pro Leu Ser Ser Pro Ser 

35 40 45 

Pro Thr Ala Leu Pro Gin Gly Gly Ser His Thr Glu Thr Glu Asp 

50 55 60 

Arg Leu Phe Lys His Leu Phe Arg Gly Tyr Asn Arg Trp Ala Arg 

65 70 75 

Pro Val Pro Asn Thr Ser Asp Val Asp Glu Lys Asn Gin Met Met 

80 85 90 

Thr Thr Asn Val Trp Leu Lys Gin Glu Trp Ser Asp Tyr Lys Leu 

95 100 105 

Arg Trp Asn Pro Thr Asp Phe Gly Asn He Thr Ser Leu Arg Val 

110 H5 120 

Pro Ser Glu Met He Trp He Pro Asp He Val Leu Tyr Asn Lys 

125 130 135 

Thr Ala Arg 



<210> 3 
<211> 355 
<212> PRT 
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<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7509132CD1 

<400> 3 

Met Ser Trp Arg Cys Trp Gly Ala Ala Ser Trp Ala Trp Pro Met 
1 5 10 15 

Leu Leu Pro Pro Met Lys Cys Ser Ser Leu Asp Asp Ser Ser Leu 
20 25 30 

Ala Pro Thr Gin Val Leu Gly Leu Glu Ser Leu Leu Gly Thr Ala 
35 40 45 

Ser Leu Trp Pro Leu Leu Leu Gly Leu Thr Val Leu Pro Ala Leu 
50 55 60 

Leu Gin Leu Val Leu Leu Pro Phe Cys Pro Glu Ser Pro Arg Tyr 
65 70 75 

Leu Tyr He He Gin Asn Leu Glu Gly Pro Ala Arg Lys Ser Leu 
80 85 90 

Lys Arg Leu Thr Gly Trp Ala Asp Val Ser Gly Val Leu Ala Glu 
95 100 105 

Leu Lys Asp Glu Lys Arg Lys Leu Glu Arg Glu Arg Pro Leu Ser 

110 115 120 

Leu Leu Gin Leu Leu Gly Ser Arg Thr His Arg Gin Pro Leu He 

125 130 135 

He Ala Val Val Leu Gin Leu Ser Gin Gin Leu Ser Gly He Asn 

140 145 150 

Ala Val Phe Tyr Tyr Ser Thr Ser He Phe Glu Thr Ala Gly Val 

155 160 165 

Gly Gin Pro Ala Tyr Ala Thr He Gly Ala Gly Val Val Asn Thr 

170 175 180 

Val Phe Thr Leu Val Ser Val Leu Leu Val Glu Arg Ala Gly Arg 

185 190 195 

Arg Thr Leu His Leu Leu Gly Leu Ala Gly Met Cys Gly Cys Ala 

200 205 210 

He Leu Met Thr Val Ala Leu Leu Leu Leu Glu Arg Val Pro Ala 

215 220 225 

Met Ser Tyr Val Ser He Val Ala He Phe Gly Phe Val Ala Phe 

230 235 240 

Phe Glu He Gly Pro Gly Pro He Pro Trp Phe He Val Ala Glu 

245 ■ 250 255 

Leu Phe Ser Gin Gly Pro Arg Pro Ala Ala Met Ala Val Ala Gly 

260 265 270 

Phe Ser Asn Trp Thr Ser Asn Phe He He Gly Met Gly Phe Gin 

275 280 285 

Tyr Val Ala Glu Ala Met Gly Pro Tyr Val Phe Leu Leu Phe Ala 

290 295 300 

Val Leu Leu Leu Gly Phe Phe He Phe Thr Phe Leu Arg Val Pro 

305 310 315 

Glu Thr Arg Gly Arg Thr Phe Asp Gin He Ser Ala Ala Phe His 

320 325 330 

Arg Thr Pro Ser Leu Leu Glu Gin Glu Val Lys Pro Ser Thr Glu 

335 340 345 

Leu Glu Tyr Leu Gly Pro Asp Glu Asn Asp 

350 355 
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<210> 4 

<211> 380 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7509136CD1 



<400> 4 

Met Ser Thr Lys Val Tyr Leu Asp Leu Glu Trp Thr Asp Tyr Arg 
15 10 15 

Leu Ser Trp Asp Pro Ala Glu His Asp Gly He Asp Ser Leu Arg 

20 25 30 

He Thr Ala Glu Ser Val Trp Leu Pro Asp Val Val Leu Leu Asn 

35 40 45 

Asn Asn Asp Gly Asn Phe Asp Val Ala Leu Asp He Ser Val Val 

50 55 60 

Val Ser Ser Asp Gly Ser Val Arg Trp Gin Pro Pro Gly He Tyr 

65 70 75 

Arg Ser Ser Cys Ser He Gin Val Thr Tyr Phe Pro Phe Asp Trp 

80 85 90 

Gin Asn Cys Thr Met Val Phe Ser Ser Tyr Ser Tyr Asp Ser Ser 

95 100 105 

Glu Val Ser Leu Gin Thr Gly Leu Gly Pro Asp Gly Gin Gly His 
110 H5 120 

Gin Glu He His He His Glu Gly Thr Phe He Glu Asn Gly Gin 
125 130 135 

Trp Glu He He His Lys Pro Ser Arg Leu He Gin Pro Pro Gly 

140 145 150 

Asp Pro Arg Gly Gly Arg Glu Gly Gin Arg Gin Glu Val He Phe 

155 160 165 

Tyr Leu He He Arg Arg Lys Pro Leu Phe Tyr Leu Val Asn Val 

170 175 180 

He Ala Pro Cys He Leu He Thr Leu Leu Ala He Phe Val Phe 

185 190 195 

Tyr Leu Pro Pro Asp Ala Val He Leu Ser Val Val Val Leu Asn 

200 205 210 

Leu His His Arg Ser Pro His Thr His Gin Met Pro Leu Trp Val 

215 220 225 

Arg Gin He Phe He His Lys Leu Pro Leu Tyr Leu Arg Leu Lys 

230 235 240 

Arg Pro Lys Pro Glu Arg Asp Leu Met Pro Glu Pro Pro His Cys 

245 250 255 

Ser Ser Pro Gly Ser Gly Trp Gly Arg Gly Thr Asp Glu Tyr Phe 

260 265 270 

He Arg Lys Pro Pro Ser Asp Phe Leu Phe Pro Lys Pro Asn Arg 

275 280 285 

Phe Gin Pro Glu Leu Ser Ala Pro Asp Leu Arg Arg Phe He Asp 

290 295 300 

Gly Pro Asn Arg Ala Val Ala Leu Leu Pro Glu Leu Arg Glu Val 

305 310 315 

Val Ser Ser He Ser Tyr He Ala Arg Gin Leu Gin Glu Gin Glu 

320 325 330 

Asp His Asp Ala Leu Lys Glu Asp Trp Gin Phe Val Ala Met Val 

335 340 345 
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Val Asp Arg Leu Phe Leu Trp Thr Phe He He Phe Thr Ser Val 
350 355 360 

Gly Thr Leu Val He Phe Leu Asp Ala Thr Tyr His Leu Pro Pro 
365 370 375 

Pro Asp Pro Phe Pro 
380 

<210> 5 

<211> 375 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7509178CD1 

<400> 5 

Met Glu Pro Trp Pro Leu Leu Leu Leu Phe Ser Leu Cys Ser Ala 
1 5 10 15 

Gly Leu Val Leu Gly Ser Glu His Glu Thr Arg Leu Val Ala Lys 
20 25 30 

Leu Phe Lys Asp Tyr Ser Ser Val Val Arg Pro Val Glu Asp His 
35 40 45 

Arg Gin Val Val Glu Val Thr Val Gly Leu Gin Leu He Gin Leu 
50 55 60 

He Asn Val Asp Glu Val Asn Gin He Val Thr Thr Asn Val Arg 
65 70 75 

Leu Lys Gin Asn Cys Ser Met Lys Leu Gly Thr Trp Thr Tyr Asp 
80 85 90 

Gly Ser Val Val Ala He Asn Pro Glu Ser Asp Gin Pro Asp Leu 
95 100 105 

Ser Asn Phe Met Glu Ser Gly Glu Trp Val He Lys Glu Ser Arg 

110 115 120 

Gly Trp Lys His Ser Val Thr Tyr Ser Cys Cys Pro Asp Thr Pro 

125 130 135 

Tyr Leu Asp He Thr Tyr His Phe Val Met Gin Arg Leu Pro Leu 

140 145 150 

Tyr Phe He Val Asn Val He He Pro Cys Leu Leu Phe Ser Phe 

155 160 165 

Leu Thr Gly Leu Val Phe Tyr Leu Pro Thr Asp Ser Gly Glu Lys 

170 175 180 

Met Thr Leu Ser He Ser Val Leu Leu Ser Leu Thr Val Phe Leu 

185 190 195 

Leu Val He Val Glu Leu He Pro Ser Thr Ser Ser Ala Val Pro 

200 205 210 

Leu He Gly Lys Tyr Met Leu Phe Thr Met Val Phe Val He Ala 

215 220 225 

Ser He He He Thr Val He Val He Asn Thr His His Arg Ser 

230 235 240 

Pro Ser Thr His Val Met Pro Asn Trp Val Arg Lys Val Phe He 

245 250 255 

Asp Thr He Pro Asn He Met Phe Phe Ser Thr Met Lys Arg Pro 

260 265 270 

Ser Arg Glu Lys Gin Asp Lys Lys He Phe Thr Glu Asp He Asp 

275 280 285 

He Ser Asp He Ser Gly Lys Pro Gly Pro Pro Pro Met Gly Phe 
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290 295 300 

His Ser Pro Leu lie Lys His Pro Glu Val Lys Ser Ala He Glu 

305 310 315 

Gly He Lys Tyr He Ala Glu Thr Met Lys Ser Asp Gin Glu Ser 

320 325 330 

Asn Asn Ala Ala Ala Glu Trp Lys Tyr Val Ala Met Val Met Asp 

335 340 345 

His He Leu Leu Gly Val Phe Met Leu Val Cys He He Gly Thr 

350 355 360 

Leu Ala Val Phe Ala Gly Arg Leu He Glu Leu Asn Gin Gin Gly 

365 370 375 



<210> 6 
<211> 153 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc__feature 

<223> Incyte ID No: 7509214CD1 

<400> 6 

Met Ala Pro Pro Trp Val Pro Ala Met Gly Phe Thr Leu Ala Pro 
! 5 10 15 

Ser His Gly Val Arg Leu Leu Pro Gly Leu Glu Arg Ala Gly Arg 
20 25 30 

Leu His Arg Glu Gly Cys Gly Ser Pro Gly Pro Leu His Trp Ala 
35 40 45 

Ala Gly Pro Glu Leu Gly Met Ala Pro His Leu Leu Trp Cys Pro 
50 55 60 

Thr Asn Gly Leu Gly Leu Gly Gly Ser Pro Ala Gly Gin Trp Gly 
' 65 70 75 

Gly Gly Ser His Tyr Arg Gly Leu Val Pro Gly Glu Pro Ala Gly 
80 85 90 

Arg Pro Pro Ala Leu Pro Leu Pro Gly Leu Ala Gly Leu Arg Asp 
95 100 105 

His Thr Gin Leu Leu Arg Met Ala Gly Gin Pro Trp Leu Ala Trp 
HO 115 120 

Gly Thr Ala Ala Ala Arg Val Ser Ala Arg- Pro Thr Arg Asp Cys 
125 130 135 

Ser Cys Thr Ser Arg Cys His His Ala Cys Asp Val Val Ala Val 
140 145 150 

Thr Leu Ser 



<210> 7 
<211> 369 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7509244CD1 

<400> 7 
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Met Ser Thr Lys Val Tyr Leu Asp Leu Glu Trp Thr Asp Tyr Arg 
15 10 15 

Leu Ser Trp Asp Pro Ala Glu His Asp Gly lie Asp Ser Leu Arg 
20 25 30 

lie Thr Ala Glu Ser Val Trp Leu Pro Asp Val Val Leu Leu Asn 
35 40 45 

Asn Asn Asp Gly Asn Phe Asp Val Ala Leu Asp He Ser Val Val 
50 55 60 

Val Ser Ser Asp Gly Ser Val Arg Trp Gin Pro Pro Gly He Tyr 
65 70 75 

Arg Ser Ser Cys Ser He Gin Val Thr Tyr Phe Pro Phe Asp Trp 
80 85 90 

Gin Asn Cys Thr Met Val Phe Ser Ser Tyr Ser Tyr Asp Ser Ser 
95 100 105 

Glu Val Ser Leu Gin Thr Gly Leu Gly Pro Asp Gly Gin Gly His 

110 115 120 

Gin Glu He His He His Glu Gly Thr Phe He Glu Asn Gly Gin 

125 130 135 

Trp Glu He He His Lys Pro Ser Arg Leu He Gin Pro Pro Gly 

140 145 150 

Asp Pro Arg Gly Gly Arg Glu Gly Gin Arg Gin Glu Val He Phe 

155 160 165 

Tyr Leu He He Arg Arg Lys Pro Leu Phe Tyr Leu Val Asn Val 

170 175 180 

He Ala Pro Cys He Leu He Thr Leu Leu Ala He Phe Val Phe 

185 190 195 

Tyr Leu Pro Pro Asp Ala Gly Glu Lys Met Gly Leu Ser He Phe 

200 205 210 

Ala Leu Leu Thr Leu Thr Val Phe Leu Leu Leu Leu Ala Asp Lys 

215 220 225 

Val Pro Glu Thr Ser Leu Ser Val Pro He He He Lys Tyr Leu 

230 235 240 

Met Phe Thr Met Val Leu Val Thr Phe Ser Val He Leu Ser Val 

245 250 255 

Val Val Leu Asn Leu His His Arg Ser Pro His Thr His Gin Met 

260 265 270 

Pro Leu Trp Val Arg Gin He Phe He His Lys Leu Pro Leu Tyr 

275 280 285 

Leu Arg Leu Lys Arg Pro Lys Pro Glu Arg Asp Leu Met Pro Glu 

290 295 300 

Leu Arg Glu Val Val Ser Ser He Ser Tyr He Ala Arg Gin Leu 

305 310 315 

Gin Glu Gin Glu Asp His Asp Ala Leu Lys Glu Asp Trp Gin Phe 

320 325 330 

Val Ala Met Val Val Asp Arg Leu Phe Leu Trp Thr Phe He He 

335 340 345 

Phe Thr Ser Val Gly Thr Leu Val He Phe Leu Asp Ala Thr Tyr 

350 355 360 

His Leu Pro Pro Pro Asp Pro Phe Pro 

365 



<210> 8 

<211> 303 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 7509256CD1 

<400> 8 

Met Lys Phe Leu Leu Thr Thr Ala Phe Leu He Leu He Ser Leu 
15 10 15 

Trp Val Glu Glu Ala Tyr Ser Lys Glu Lys Ser Ser Lys Lys Gly 
20 25 30 

Lys Gly Lys Lys Lys Gin Tyr Leu Cys Pro Ser Gin Gin Ser Ala 
35 40 45 

Glu Asp Leu Ala Arg Val Pro Ala Asn Ser Thr Ser Asn He Leu 
50 55 60 

Asn Arg Leu Leu Val Ser Tyr Asp Pro Arg He Arg Pro Asn Phe 
65 70 75 

Lvs Gly He Pro Val Asp Val Val Val Asn He Phe He Asn Ser 
80 85 90 

Phe Gly Ser He Gin Glu Thr Thr Met Asp Tyr Arg Val Asn He 
95 100 105 

Phe Leu Arg Gin Lys Trp Asn Asp Pro Arg Leu Lys Leu Pro Ser 
110 H5 I 20 

Asp Phe Arg Gly Ser Asp Ala Leu Thr Val Asp Pro Thr Met Tyr 
125 130 135 

Lys Cys Leu Trp Lys Pro Asp Leu Phe Phe Ala Asn Glu Lys Ser 
140 145 150 

Ala Asn Phe His Asp Val Thr Gin Glu Asn He Leu Leu Phe He 
155 160 165 

Phe Arg Asp Gly Asp Val Leu Val Ser Met Arg Leu Ser He Thr 
170 175 I 80 

Leu Ser Cys Pro Leu Asp Leu Thr Leu Phe Pro Met Asp Thr Gin 
185 190 195 

Arg Cys Lys Met Gin Leu Glu Ser Phe Gly Tyr Thr Thr Asp Asp 
200 205 210 

Leu Arg Phe He Trp Gin Ser Gly Asp Pro Val Gin Leu Glu Lys 
215 220 225 

He Ala Leu Pro Gin Phe Asp He Lys Lys Glu Asp He Glu Tyr 
230 235 240 

Gly Asn Cys Thr Lys Tyr Tyr Lys Gly Thr Gly Tyr Tyr Thr Cys 
245 250 255 

Val Glu Val He Phe Thr Leu Arg Arg Gin Val Gly Phe Tyr Met 
260 265 270 

Met Gly Val Tyr Ala Pro Thr Leu Leu He Val Val Leu Ser Trp 
275 280 285 

Leu Ser Phe Trp He Asn Pro Asp Ala Ser Ala Ala Arg Val Pro 
290 295 300 

Leu Gly Trp 



<210> 9 

<211> 370 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7509395CD1 
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<400> 9 

Met Glu Pro Trp Pro Leu Leu Leu Leu Phe Ser Leu Cys Ser Ala 
15 10 15 

Gly Leu Val Leu Gly Ser Glu His Glu Thr Arg Leu Val Ala Lys 
20 25 30 

Leu Phe Lys Asp Tyr Ser Ser Val Val Arg Pro Val Glu Asp His 
35 40 45 

Arg Gin Val Val Glu Val Thr Val Gly Leu Gin Leu He Gin Leu 
50 55 60 

He Asn Val Asp Glu Val Asn Gin He Val Thr Thr Asn Asn Cys 
65 70 75 

Ser Met Lys Leu Gly Thr Trp Thr Tyr Asp Gly Ser Val Val Ala 

80 85 90 . 

He Asn Pro Glu Ser Asp Gin Pro Asp Leu Ser Asn Phe Met Glu 
95 100 105 

Ser Gly Glu Trp Val He Lys Glu Ser Arg Gly Trp Lys His Ser 

110 115 120 

Val Thr Tyr Ser Cys Cys Pro Asp Thr Pro Tyr Leu Asp He Thr 

125 130 135 

Tyr His Phe Val Met Gin Arg Leu Pro Leu Tyr Phe He Val Asn 

140 145 150 

Val He He Pro Cys Leu Leu Phe Ser Phe Leu Thr Gly Leu Val 

155 160 165 

Phe Tyr Leu Pro Thr Asp Ser Gly Glu Lys Met Thr Leu Ser He 

170 . 175 180 

Ser Val Leu Leu Ser Leu Thr Val Phe Leu Leu Val He Val Glu 

185 190 195 

Leu He Pro Ser Thr Ser Ser Ala Val Pro Leu He Gly Lys Tyr 

200 205 210 

Met Leu Phe Thr Met Val Phe Val He Ala Ser He He He Thr 

215 220 225 

Val He Val He Asn Thr His His Arg Ser Pro Ser Thr His Val 

230 235 240 

Met Pro Asn Trp Val Arg Lys Val Phe He Asp Thr He Pro Asn 

245 250 255 

He Met Phe Phe Ser Thr Met Lys Arg Pro Ser Arg Glu Lys Gin 

260 265 270 

Asp Lys Lys He Phe Thr Glu Asp He Asp He Ser Asp He Ser 

275 280 285 

Gly Lys Pro Gly Pro Pro Pro Met Gly Phe His Ser Pro Leu He 

290 295 300 

Lys His Pro Glu Val Lys Ser Ala He Glu Gly He Lys Tyr He 

305 310 315 

Ala Glu Thr Met Lys Ser Asp Gin Glu Ser Asn Asn Ala Ala Ala 

320 325 330 

Glu Trp Lys Tyr Val Ala Met Val Met Asp His He Leu Leu Gly 

335 340 345 

Val Phe Met Leu Val Cys He He Gly Thr Leu Ala Val Phe Ala 

350 355 360 

Gly Arg Leu He Glu Leu Asn Gin Gin Gly 

365 370 



<210> 10 

<211> 283 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> misc_ f eature 

<223> incyte ID No: 7503287CD1 

<400> 10 

Met Glu Leu Lys Ala Glu Glu Glu Glu Val Gly Gly Val Gin Pro 
15 10 15 

Val Ser lie Gin Ala Phe Ala Ser Ser Ser Thr Leu His Gly Leu 

20 25 30 

Ala His He Phe Ser Tyr Glu Arg Leu Ser Leu Lys Arg Ala Leu 

35 40 45 

Trp Ala Leu Cys Phe Leu Gly Ser Leu Ala Val Leu Leu Cys Val 

50 55 60 

Cys Thr Glu Arg Val Gin Tyr Tyr Phe His Tyr His His Val Thr 
65 70 75 

Lys Leu Asp Glu Val Ala Ala Ser Gin Leu Thr Phe Pro Ala Val 
80 85 90 

Thr Leu Cys Asn Leu Asn Glu Phe Arg Phe Ser Gin Val Ser Lys 
95 100 105 

Asn Asp Leu Tyr His Ala Gly Glu Leu Leu Ala Leu Leu Asn Asn 

110 115 120 

Arg Tyr Glu He Pro Asp Thr Gin Met Ala Asp Glu Lys Gin Leu 

125 130 135 

Glu He Leu Gin Asp Lys Ala Asn Phe Arg Ser Phe Lys Pro Lys 

140 145 150 

Pro Phe Asn Met Arg Glu Phe Tyr Asp Arg Ala Gly His Asp He 

155 160 165 

Arg Asp Met Leu Leu Ser Cys His Phe Arg Gly Glu Val Cys Ser 

170 175 180 

Ala Glu Asp Phe Lys Val Val Phe Thr Arg Tyr Gly Lys Cys Tyr 

185 190 195 

Thr Phe Asn Ser Gly Arg Asp Gly Arg Pro Arg Leu Lys Thr Met 

200 205 210 

Lys Gly Gly Thr Gly Asn Gly Leu Glu He Met Leu Asp He Gin 

215 220 225 

Gin Asp Glu Tyr Leu Pro Val Trp Gly Glu Thr Asp Glu Thr Ser 

230 235 240 

Phe Glu Ala Gly He Lys Val Gin He Phe Pro Leu Val Cys Gly 

245 250 255 

Lys Glu Gly Val Leu Thr He Glu Ser Ser Leu Cys Leu Tyr Pro 

260 265 270 

He Leu Phe Thr Phe Asn Lys Thr Asn Leu Lys Lys Asn 

275 280 

i 

<210> 11 
<211> 90 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7503320CD1 

<400> 11 

Met Arg Cys Ser Pro Gly Gly Val Trp Leu Ala Leu Ala Ala Ser 
15 10 15 
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Leu Leu His Val Ser Leu Gin Gly Glu Phe Gin Arg Lys Leu Tyr 

20 25 30 

Lys Glu Leu Val Lys Asn Tyr Asn Pro Leu Glu Arg Pro Val Ala 

35 40 45 

Asn Asp Ser Gin Pro Leu Thr Val Tyr Phe Ser Leu Ser Leu Leu 

50 55 60 

Gin lie Met Asp Val Asp Glu Lys Asn Gin Val Leu Thr Thr Thr 

65 70 75 

Thr Pro Thr Gly Ala Arg Cys Pro Ser Gly Pro Glu Ser Ser Phe 

80 85 90 



<210> 12 
<211> 549 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7503335CD1 

<400> 12 

Met Pro Ala Cys Cys Ser Cys Ser 

1 5 
Asn Lys Val Thr Arg lie Gin Ser 
20 

Trp Phe Phe His Val lie He Phe 
35 

Val Ser Asp Lys Leu Tyr Gin Arg 
50 

Val His Thr Lys Val Lys Gly He 
65 

Val Glu Asn Gly Val Lys Lys Leu 
80 

Ala Asp Tyr Thr Phe Pro Leu Gin 
95 

Thr Asn Phe Leu Lys Thr Glu Gly 
110 

Glu Tyr Pro Thr Arg Arg Thr Leu 
125 

Lys Lys Gly Trp Met Asp Pro Gin 
140 

Arg Cys Val Val His Glu Gly Asn 
155 

Ala Trp Cys Pro He Glu Ala Val 
170 

Leu Leu Asn Ser Ala Glu Asn Phe 
185 

He Asp Phe Pro Gly His Asn Tyr 
200 

Gly Leu Asn He Thr Cys Thr Phe 
215 

Cys Pro He Phe Arg Leu Gly Asp 
230 

He Phe Gin Met Trp Gin Phe Arg 
245 



Asp 


Val 


Phe Gin Tyr Glu 


Thr 




10 






15 


Met 


Asn 


Tyr Gly Thr He 


Lys 




25 






30 


Ser Tyr 


Val Cys 


Phe Ala 


Leu 




40 






45 


Lys 


Glu 


Pro Val 


He Ser 


Ser 




55 






60 


Ala 


Glu 


Val Lys 


Glu Glu 


He 




70 






75 


Val 


His 


Ser Val 


Phe Asp 


Thr 




85 






90 


Gly Asn 


Ser Phe 


Phe Val 


Met 




100 






105 


Gin 


Glu 


Gin Arg Leu Cys 


Pro 




115 






120 


Cys 


Ser 


Ser Asp Arg Gly 


Cys 




130 






135 


Ser 


Lys 


Gly He Gin Thr 


Gly 




145 






150 


Gin 


Lys 


Thr Cys 


Glu Val 


Ser 




160 






165 


Glu 


Glu 


Ala Pro 


Arg Pro 


Ala 




175 






180 


Thr 


Val 


Leu He 


Lys Asn 


Asn 




190 






195 


Thr 


Thr 


Arg Asn 


He Leu 


Pro 




205 






210 


His 


Lys 


Thr Gin 


Asn Pro 


Gin 




220 






225 


He 


Phe 


Arg Glu 


Gin Ala 


He 




235 






240 


Tyr Ala 


Lys Tyr 


Tyr Lys 


Glu 




250 






255 
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Asn Asn Val Glu Lys 
260 

Phe Asp lie Leu Val 
275 

Gin Leu Val Val Tyr 
290 

Ala Ala Val Phe lie 
305 

Cys Cys Arg Ser His 
320 

Cys Val Val Asn Glu 
335 

Val Glu Pro Lys Pro 
350 

Ser His lie Arg Met 
365 

Gin Asp Val Lys Gly 
380 

Thr Asp Leu Ser Arg 
395 

lie Pro Gly Gin Pro 
410 

Thr Pro Arg Ser Arg 
425 

Cys Leu Pro Ser Gin 
440 

Leu Cys Cys Arg Lys 
455 

Leu Phe Arg Lys Leu 
470 

Leu Leu Tyr Gin Glu 
485 

Asn Ser Arg Leu Arg 
500 

Arg Phe Gly Ser Gin 
515 

Cys Cys Arg Trp Arg 
530 

Gin Tyr Ser Gly Phe 
545 



Arg Thr Leu 
Phe Gly Thr 
He Gly Ser 
Asp Phe Leu 
He Tyr Pro 
Tyr Tyr Tyr 
Thr Leu Lys 
Val Asn Gin 
Gin Glu Val 
Leu Pro Leu 
Glu Glu He 
Asp Ser Pro 
Leu Pro Glu 
Lys Pro Gly 
Val Leu Ser 
Pro Leu Leu 
His Cys Ala 
Asp Met Ala 
He Arg Lys 
Lys Ser Pro 



He 
Gly 
Thr 
He 
Trp 
Arg 
Tyx 
Gin 
Pro 
Ala 
Gin 
Val 
Ser 
Ala 
Arg 
Ala 
Tyr 
Asp 
Glu 
Tyr 



Lys Val 
265 

Gly Lys 
280 

Leu Ser 

295 

Asp Thr 
310 

Cys Lys 
325 

Lys Lys 
340 

Val Ser 
355 

Leu Leu 
370 

Arg Pro 
385 

Leu His 
400 

Leu Leu 
415 

Trp Cys 
430 

His Arg 
445 

Cys He 
460 

His Val 
475 

Leu Asp 
490 

Arg Cys 
505 

Phe Ala 
520 

Phe Pro 
535 



Phe Gly 
Phe Asp 
Tyr Phe 
Tyr Ser 
Cys Cys 
Cys Glu 
Phe Val 
Gly Arg 
Ala Met 
Asp Thr 
Arg Lys 
Gin Cys 
Cys Leu 
Thr Thr 
Leu Gin 
Val Asp 
Tyr Ala 
He Leu 
Lys Ser 



He Arg 

270 
He He 

285 
Gly Leu 

300 
Ser Asn 

315 
Gin Pro 

330 
Ser He 

345 
Asp Glu 

360 
Ser Leu 

375 
Asp Phe 

390 
Pro Pro 

405 
Glu Ala 

420 
Gly Ser 

435 
Glu Glu 

450 
Ser Glu 

465 
Phe Leu 

480 
Ser Thr 

495 
Thr Trp 

510 
Pro Ser 

525 
Glu Gly 

540 



<210> 13 

<211> 246 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7503952CD1 



<400> 13 

Met Leu Ser Ser Val Met Ala Pro Leu Trp Ala Cys He Leu Val 
15 10 15 

Ala Ala Gly He Leu Ala Thr Asp Thr His His Pro Gin Asp Ser 
20 25 30 

Ala Leu Tyr His Leu Ser Lys Gin Leu Leu Gin Lys Tyr His Lys 
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35 40 45 

Glu Val Arg Pro Val Tyr Asn Trp Thr Lys Ala Thr Thr Val Tyr 
50 55 60 

Leu Asp Leu Phe Val His Ala He Leu Asp Val Asp Ala Glu Asn 
65 70 75 

Gin He Leu Lys Thr Ser Val Trp Tyr Gin Glu Val Trp Asn Asp 
80 85 90 

Glu Phe Leu Ser Trp Asn Ser Ser Met Phe Asp Glu He Arg Glu 
95 100 105 

He Ser Leu Pro Leu Ser Ala He Trp Ala Pro Asp He He He 

110 115 120 

Asn Glu Phe Val Asp He Glu Arg Tyr Pro Asp Leu Pro Tyr Val 

125 130 135 

Tyr Val Asn Ser 'Ser Gly Thr He Glu Asn Tyr Lys Pro He Gin 

140 145 150 

Val Val Ser Ala Cys Ser Leu Glu Thr Tyr Ala Phe Pro Phe Asp 

155 160 165 

Val Gin Asn Cys Ser Leu Thr Phe Lys Ser He Leu His Thr Val 

170 175 180 

Glu Asp Val Asp Leu Ala Phe Leu Arg Ser Pro Glu Asp He Gin 

185 190 195 

His Asp Lys Lys Ala Phe Leu Asn Asp Ser Glu Trp Glu Leu Leu 

200 205 210 

Ser Val Ser Ser Thr Tyr Ser He Leu Gin Ser Ser Ala Gly Gly 

215 220 225 

Phe Ala Gin He Gin Phe Asn Gly Thr Ser Ser Pro Ser Ala Trp 

230 235 240 

Pro Ser Trp Phe Ser Ala * 

245 

<210> 14 
<211> 273 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 750453 0CD1 

<400> 14 

Met Val Gin Ala Ser Gly His Arg Arg Ser Thr Arg Gly Ser Lys 
15 10 15 

Met Val Ser Trp Ser Val He Ala Lys He Gin Glu He Leu Gin 

20 25 30 

Arg Lys Met Val Arg Glu Phe Leu Ala Glu Phe Met Ser Thr Tyr 

35 40 45 

Val Met Met Val Phe Gly Leu Gly Ser Val Ala His Met Val Leu 

50 55 60 

Asn Lys Lys Tyr Gly Ser Tyr Leu Gly Val Asn Leu Gly Phe Gly 

65 70 75 

Phe Gly Val Thr Met Gly Val His Val Ala Gly Arg He Ser Gly 

80 85 90 

Ala His Met Asn Ala Ala Val Thr Phe Ala Asn Cys Ala Leu Gly 

95 100 105 

Arg Val Pro Trp Arg Lys Phe Pro Val Tyr Val Leu Gly Gin Phe 
110 115 120 
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Leu Gly ser Phe Leu Ala Ala Ala Thr lie Tyr Ser Leu Phe Tyr 

125 13 0 1^ 

Thr Ala He Leu His Phe Ser Gly Gly Gin Leu Met Val Thr Gly 



140 145 



Pro Val Ala ate Ala Gly He Phe Ala inr Tyr Leu Pro Asp His 



155 



160 



Met Thr Leu Trp Arg Gly Phe Leu Asn Glu Ala Trp Leu Thr Gly 

170 175 180 

Met Leu Gin Leu Cys Leu Phe Ala lie Thr Asp Gin Glu Asn Asn 

185 190 195 

Pro Ala Leu Pro Gly Thr Glu Ala Leu Val He Gly He Leu Val 

200 205 210 

Val lie lie Gly Val Ser Leu Gly Met Asn Thr Gly Tyr Ala He 

215 220 225 

Asn Pro Ser Arg Asp Leu Pro Pro Arg He Phe Thr Phe He Ala 

230 2 35 240 

Gly Trp Gly Lys Gin Val Phe Arg Trp His His Leu Pro Gly Leu 

245 250 255 

Hxs Trp Leu His His Pro Thr Gly Ala Pro Glu He Gly Gly Phe 

260 26 5 270 

Cys Gly Val 



<210> 15 
<211> 245 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7509303CD1 

<400> 15 



Met Glu Gly Asn Lys Leu Glu Glu Gin Asp Ser Ser Pro Pro Gin 



1 5 10 



Ser Thr Pro Gly Leu Met Lys Gly Asn Lys Arg Glu Glu Gin Gly 
20 25 -a n 

Leu Gly Pro Glu Pro Ala Ala Pro Gin Gin Pro Thr Ala Glu Glu 

3 ^ 40 AK 

Glu Ala Leu lie Glu Phe His Arg Ser Tyr Arg Glu Leu Phe Glu 

60 



50 



Phe Phe Cys Asn Asn Thr Thr He His Gly Ala lie Arg Leu Val 

65 70 75 

Cys Ser Gin His Asn Arg Met Lys Thr Ala Phe Trp Ala Val Leu 



80 85 



Trp Leu Cys Thr Phe Gly Met Met Tyr Trp Gin Phe Gly Leu Leu 



95 100 



Phe Gly Glu Tyr Phe Ser Tyr Pro Val Ser Leu Asn He Asn Leu 

110 I" 120 

Asn Ser Asp Lys Leu Val Phe Pro Ala Val Thr He Cys Thr Leu 

125 I 30 135 

Asn Pro Tyr Arg Tyr Pro Glu He Lys Glu Glu Leu Glu Glu Leu 

140 I 45 150 

Asp Arg He Thr Glu Gin Thr Leu Phe Asp Leu Tyr Lys Tyr Ser 

155 160 ICS 

Ser Phe Thr Thr Leu Val Ala Gly Ser ^ g Ser Arg ^ Asp Leu 
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170 175 180 

Arg Gly Thr Leu Pro His Pro Leu Gin Arg Leu Arg Val Pro Pro 

185 190 195 

Pro Pro His Gly Ala Arg Arg Ala Arg Ser Val Ala Ser Ser Leu 

200 205 210 

Arg Asp Asn Asn Pro Gin Val Asp Trp Lys Asp Trp Lys lie Gly 

215 220 225 

Phe Gin Leu Glu Leu Leu Ser Leu Pro Pro Pro Asp Val Trp Lys 

230 235 240 

Leu Leu Tyr Phe Gin 

245 

<210> 16 

<211> 364 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7509910CD1 

<400> 16 

Met Pro Ala Cys Cys Ser Cys Ser Asp Val Phe Gin Tyr Glu Thr 
15 10 15 

Asn Lys Val Thr Arg lie Gin Ser Met Asn Tyr Gly Thr lie Lys 
20 25 30 

Trp Phe Phe His Val lie lie Phe Ser Tyr Val Cys Phe Ala Leu 
35 40 45 

Val Ser Asp Lys Leu Tyr Gin Arg Lys Glu Pro Val lie Ser Ser 
50 55 60 

Val His Thr Lys Val Lys Gly lie Ala Glu Val Lys Glu Glu He 
65 70 75 

Val Glu Asn Gly Val Lys Lys Leu Val His Ser Val Phe Asp Thr 
80 85 90 

Ala Asp Tyr Thr Phe Pro Leu Gin Gly Asn Ser Phe Phe Val Met 
95 100 105 

Thr Asn Phe Leu Lys Thr Glu Gly Gin Glu Gin Arg Leu Cys Pro 
110 115 120 

Glu Tyr Pro Thr Arg Arg Thr Leu Cys Ser Ser Asp Arg Gly Cys 
125 130 135 

Lys Lys Gly Trp Met Asp Pro Gin Ser Lys Gly He Gin Thr Gly 
140 145 150 

Arg Cys Val Val His Glu Gly Asn Gin Lys Thr Cys Glu Val Ser 
155 160 165 

Ala Trp Cys Pro He Glu Ala Val Glu Glu Ala Pro Arg Pro Ala 
170 175 180 

Leu Leu Asn Ser Ala Glu Asn Phe Thr Val Leu He Lys Asn Asn 
185 190 195 

He Asp Phe Pro Gly His Asn Tyr Thr Thr Arg Asn He Leu Pro 
200 205 210 

Gly Leu Asn He Thr Cys Thr Phe His Lys Thr Gin Asn Pro Gin 
215 220 225 

Cys Pro He Phe Arg Leu Gly Asp He Phe Arg Glu Thr Gly Asp 
230 235 240 

Asn Phe Ser Asp Val Ala He Gin Gly Gly He Met Gly He Glu 
245 250 255 
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He Tyr Trp Asp Cys Asn Leu Asp Arg Trp Phe His His Cys Arg 

260 265 ~ 270 

Pro Lys Tyr Ser Phe Arg Arg Leu Asp Asp Lys Thr Thr Asn Val 

275 280 285 

Ser Leu Tyr Pro Gly Tyr Asn Phe Arg Tyr Ala Lys Tyr Tyr Lys 

290 295 300 

Glu Asn Asn Val Glu Lys Arg Thr Leu He Lys Val Phe Gly He 

305 310 315 

Arg Phe Asp He Leu Val Phe Gly Thr Gly Gly Lys Phe Asp He 

320 325 " 330 

He Gin Leu Val Val Tyr He Gly Ser Thr Leu Ser Tyr Phe Gly 

335 340 345 

Leu Val Arg Asp Ser Leu Phe His Ala Leu Gly Lys Trp Phe Gly 

350 355 ~ ~ 360 

Glu Gly Ser Asp 



<210> 17 

<211> 1623 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7509982CD1 

<400> 17 

Met Asn Met Lys Gin Lys Ser Val Tyr Gin Gin Thr Lys Ala Leu 
1 5 io 15 

Leu Cys Lys Asn Phe Leu Lys Lys Trp Arg Met Lys Arg Glu Ser 
20 25 30 

Leu Leu Glu Trp Gly Leu Ser He Leu Leu Gly Leu Cys He Ala 
35 40 45 

Leu Phe Ser Ser Ser Met Arg Asn Val Gin Phe Pro Gly Met Ala 
50 55 60 

Pro Gin Asn Leu Gly Arg Val Asp Lys Phe Asn Ser Ser Ser Leu 
65 70 75 

Met Val Val Tyr Thr Pro He Ser Asn Leu Thr Gin Gin He Met 
80 85 90 

Asn Lys Thr Ala Leu Ala Pro Leu Leu Lys Gly Thr Ser Val He 
95 100 105 

Gly Ala Pro Asn Lys Thr His Met Asp Glu He Leu Leu Glu Asn 
HO 115 120 

Leu Pro Tyr Ala Met Gly He He Phe Asn Glu Thr Phe Ser Tyr 
125 130 135 

Lys Leu He Phe Phe Gin Gly Tyr Asn Ser Pro Leu Trp Lys Glu 
140 145 150 

Asp Phe Ser Ala His Cys Trp Asp Gly Tyr Gly Glu Phe Ser Cys 
155 160 165 

Thr Leu Thr Lys Tyr Trp Asn Arg Gly Phe Val Ala Leu Gin Thr 
170 175 180 

Ala He Asn Thr Ala He He Glu He Thr Thr Asn His Pro Val 
185 190 195 

Met Glu Glu Leu Met Ser Val Thr Ala He Thr Met Lys Thr Leu 
200 205 210 

Pro Phe He Thr Lys Asn Leu Leu His Asn Glu Met Phe He Leu 
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Phe Phe Leu 
Asn Val Thr 
Met Gly Leu 
Tyr Ala Gly 
lie Thr Phe 
Phe He Pro 
Phe Leu Leu 
Val Val Phe 
Val Phe Tyr 
He Cys Ser 
Lys Leu Asp 
Gly Asp Ser 
Asp Gly Leu 
Leu Pro Tyr 
Asn Ser Ser 
He Glu Lys 
Glu Pro Val 
Arg Asn Val 
Ala Leu Lys 
Ala He Leu 
He Leu Asn 
Tyr Asn Lys 
Lys He Thr 
Leu Thr Val 
He His Leu 
Glu Leu Asp 
Ser Glu Gly 
Gly Asp Pro 



215 
Leu His Phe 

230 
Lys Glu Arg 

245 
Gin Asp Ser 

260 
Phe He Phe 

275 
Thr Gin He 

290 
Phe Phe Leu 

305 
Ser Val Leu 

320 
Leu Leu Thr 

335 
Glu Gin Leu 

350 

Pro Phe Ala 

365 
Tyr Asn Leu 

380 
Tyr Thr Met 

395 

He Tyr Leu 

410 
Gly Asp Glu 

425 
Ser Cys Phe 

440 
Glu He Asp 

455 

Ala Pro Glu 

470 
Lys Lys Glu 

485 
Gly Leu Leu 

500 

Gly His Ser 

515 
Gly Leu Ser 

530 
Asn Leu Ser 

545 
Gly Val Cys 

560 
Lys Glu Asn 

575 
Lys Glu Val 

590 

Met Gin Asn 
605 

Gin Lys Arg 
620 

Gin He Leu 



Ser Pro Leu 
Lys Lys Ser 
Ala Phe Trp 
He He Ser 
He Val Met 
Tyr Gly Leu 
Leu Lys Lys 
Leu Phe Trp 
Pro Ser Ser 
Phe Thr Thr 
Asn Gly Val 
He Ala Thr 
Leu Leu Ala 
Arg His Tyr 
Gin His Gin 
Ala Glu His 
Phe Gin Gly 
Tyr Lys Gly 
Phe Asp He 
Gly Ala Gly 
Val Pro Thr 
Glu Met Gin 
Pro Gin Phe 
Leu Ser Leu 
Glu Gin Glu 
He Gin Asp 
Lys Leu Thr 
Leu Leu Asp 



220 

Val Tyr Phe 
235 

Lys Asn Leu 
250 

Leu Ser Trp 
265 

He Phe He 
280 

Thr Gly Phe 
295 

Ser Leu Val 
310 

Ala Val Leu 
325 

Gly Cys Leu 
340 

Leu Glu Trp 
355 

Gly Met He 
370 

He Phe Pro 
385 

Phe Ser Met 
400 

Leu Tyr Phe 
415 

Ser Pro Leu 
430 

Arg Thr Asn 
445 

Pro Ser Asp 
460 

Lys Glu Ala 
475 

Lys Ser Gly 
490 

Tyr Glu Gly 
505 

Lys Ser Ser 
520 

Glu Gly Ser 
535 

Asp Leu Glu 
550 

Asn Val Gin 
565 

Phe Ala Lys 
580 

Val Gin Arg 
595 

Asn Leu Ala 
610 

Phe Gly He 
625 

Glu Pro Thr 



225 

He Ser Leu 
240 

Met Lys Met 
255 

Gly Leu He 
270 

Thr He He 
285 

Met Val He 
300 

Ala Leu Val 
315 

Thr Asn Leu 
330 

Gly Phe Thr 
345 

He Leu Asn 
360 

Gin He He 
375 

Asp Pro Ser 
390 

Leu Leu Leu 
405 

Asp Lys He 
420 

Phe Phe Leu 
435 

Ala Lys Val 
450 

Asp Tyr Phe 
465 

He Arg He 
480 

Lys Val Glu 
495 

Gin He Thr 
510 

Leu Leu Asn 
525 

Val Thr He 
540 

Glu He Arg 
555 

Phe Asp He 
570 

He Lys Gly 
585 

He Leu Leu 
600 

Lys His Leu 
615 

Thr He Leu 
630 

Thr Gly Leu 
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635 640 645 

Asp Pro Phe Ser Arg Asp Gin Val Trp Ser Leu Leu Arg Glu Arg 

650 655 660 

Arg Ala Asp His Val lie Leu Phe Ser Thr Gin Ser Met Asp Glu 

665 670 675 

Ala Asp lie Leu Ala Asp Arg Lys Val He Met Ser Asn Gly Arg 

680 685 690 

Leu Lys Cys Ala Gly Ser Ser Met Phe Leu Lys Arg Arg Trp Gly 

695 700 ~ 705 

Leu Gly Tyr His Leu Ser Leu His Arg Asn Glu He Cys Asn Pro 

710 715 720 

Glu Gin He Thr Ser Phe He Thr His His He Pro Asp Ala Lys 

725 730 735 

Leu Lys Thr Glu Asn Lys Glu Lys Leu Val Tyr Thr Leu Pro Leu 

740 745 750 

Glu Arg Thr Asn Thr Phe Pro Asp Leu Phe Ser Asp Leu Asp Lys 

755 760 765 

Cys Ser Asp Gin Gly Val Thr Gly Tyr Asp He Ser Met Ser Thr 

770 775 780 

Leu Asn Glu Val Phe Met Lys Leu Glu Gly Gin Ser Thr He Glu 

785 790 795 

Gin Gly Lys Ala He Cys He Asn Phe Glu Gin Val Glu Met He 

800' 805 810 

Arg Asp Ser Glu Ser Leu Asn Glu Met Glu Leu Ala His Ser Ser 

815 820 825 

Phe Ser Glu Met Gin Thr Ala Val Ser Asp Met Gly Leu Trp Arg 

830 835 " 840 

Met Gin Val Phe Ala Met Ala Arg Leu Arg Phe Leu Lys Leu Lys 

845 850 855 

Arg Gin Thr Lys Val Leu Leu Thr Leu Leu Leu Val Phe Gly He 

860 865 870 

Ala He Phe Pro Leu He Val Glu Asn He He Tyr Ala Met Leu 

875 880 885 

Asn Glu Lys He Asp Trp Glu Phe Lys Asn Glu Leu Tyr Phe Leu 

890 895 ' 900 

Ser Pro Gly Gin Leu Pro Gin Glu Pro Arg Thr Ser Leu Leu He 

905 910 915 

He Asn Asn Thr Glu Ser Asn He Glu Asp Phe He Lys Ser Leu 

920 925 930 

Lys His Gin Asn He Leu Leu Glu Val Asp Asp Phe Glu Asn Arg 

935 940 945 

Asn Gly Thr Asp Gly Leu Ser Tyr Asn Gly Ala He He Val Ser 

950 955 960 

Gly Lys Gin Lys Asp Tyr Arg Phe Ser Val Val Cys Asn Thr Lys 

965 970 975 

Arg Leu His Cys Phe Pro He Leu Met Asn He He Ser Asn Gly 

980 985 990 

Leu Leu Gin Met Phe Asn His Thr Gin His He Arg He Glu Ser 
995 1000 1005 

Ser Pro Phe Pro Leu Ser His He Gly Leu Trp Thr Gly Leu Pro 
1010 1015 1020 

Asp Gly Ser Phe Phe Leu Phe Leu Val Leu Cys Ser He Ser Pro 
1025 1030 1035 

Tyr He Thr Met Gly Ser He Ser Asp Tyr Lys Lys Asn Ala Lys 
1040 1045 1050 

Ser Gin Leu Trp He Ser Gly Leu Tyr Thr Ser Ala Tyr Trp Cys 
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1055 1060 1065 

Gly Gin Ala Leu Val Asp Val Ser Phe Phe lie Leu lie Leu Leu 

1070 1075 1080 

Leu Met Tyr Leu lie Phe Tyr lie Glu Asn Met Gin Tyr Leu Leu 

1085 1090 1095 

lie Thr Ser Gin lie Val Phe Ala Leu Val He Val Thr Pro Gly 

1100 H05 1110 

Tyr Ala Ala Ser Leu Val Phe Phe He Tyr Met He Ser Phe He 

1115 1120 1125 

Phe Arg Lys Arg Arg Lys Asn Ser Gly Leu Trp Ser Phe Tyr Phe 

1130 1135 1140 

Phe Phe Ala Ser Thr He Met Phe Ser He Thr Leu He Asn His 

1145 1150 1155 

Phe Asp Leu Ser He Leu He Thr Thr Met Val Leu Val Pro Ser 

1160 1165 1170 

Tyr Thr Leu Leu Gly Phe Lys Thr Phe Leu Glu Val Arg Asp Gin 

1175 1180 1185 

Glu His Tyr Arg Glu Phe Pro Glu Ala Asn Phe Glu Leu Ser Ala 

1190 1195 1200 

Thr Asp Phe Leu Val Cys Phe He Pro Tyr Phe Gin Thr Leu Leu 

1205 1210 1215 

Phe Val Phe Val Leu Arg Tyr Met Glu Leu Lys Cys Gly Lys Lys 

1220 1225 1230 

Arg Met Arg Lys Asp Pro Val Phe Arg He Ser Pro Gin Ser Arg 

1235 1240 1245 

Asp Ala Lys Pro Asn Pro Glu Glu Pro He Asp Glu Asp Glu Asp 

1250 1255 1260 

He Gin Thr Glu Arg He Arg Thr Val Thr Ala Leu Thr Thr Ser 

1265 1270 1275 

He Leu Asp Glu Lys Pro Val He He Ala Ser Cys Leu His Lys 

1280 1285 1290 

Glu Tyr Ala Gly Gin Lys Lys Ser Cys Phe Ser Lys Arg Lys Lys 

1295 1300 1305 

Lys He Ala Ala Arg Asn He Ser Phe Cys Val Gin Glu Gly Glu 

1310 1315 1320 

He Leu Gly Leu Leu Gly Pro Ser Gly Ala Gly Lys Ser Ser Ser 

1325 1330 1335 

He Arg Met He Ser Gly He Thr Lys Pro Thr Ala Gly Glu Val 

1340 1345 1350 

Glu Leu Lys Gly Cys Ser Ser Val Leu Gly His Leu Gly Tyr Cys 

1355 1360 1365 

Pro Gin Glu Asn Val Leu Trp Pro Met Leu Thr Leu Arg Glu His 

1370 1375 1380 

Leu Glu Val Tyr Ala Ala Val Lys Gly Leu Arg Glu Ala Asp Ala 

1385 1390 1395 

Arg Leu Ala He Ala Arg Leu Val Ser Ala Phe Lys Leu His Glu 

1400 1405 1410 

Gin Leu Asn Val Pro Val Gin Lys Leu Thr Ala Gly He Thr Arg 

1415 1420 1425 

Lys Leu Cys Phe Val Leu Ser Leu Leu Gly Asn Ser Pro Val Leu 

1430 1435 1440 

Leu Leu Asp Glu Pro Ser Thr Gly He Asp Pro Thr Gly Gin Gin 

1445 1450 1455 

Gin Met Trp Gin Ala He Gin Ala Val Val Lys Asn Thr Glu Arg 

1460 1465 1470 

Gly Val Leu Leu Thr Thr His Asn Leu Ala Glu Ala Glu Ala Leu 
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1475 1480 1485 

Cys Asp Arg Val Ala lie Met val Ser Gly Arg Leu Arg Cys He 

1490 1495 1500 

Gly Ser He Gin His Leu Lys Asn Lys Leu Gly Lys Asp Tyr He 

1505 1510 1515 

Leu Glu Leu Lys Val Lys Glu Thr Ser Gin Val Thr Leu Val His 

1520 1525 1530 

Thr Glu He Leu Lys Leu Phe Pro Gin Ala Ala Gly Gin Glu Arg 

1535 1540 ~ 1545 

Tyr Ser Ser Leu Leu Thr Tyr Lys Leu Pro Val Ala Asp Val Tyr 

1550 1555 1560 

Pro Leu Ser Gin Thr Phe His Lys Leu Glu Ala Val Lys His Asn 

1565 1570 ~ 1575 

Phe Asn Leu Glu Glu Tyr Ser Leu Ser Gin Cys Thr Leu Glu Lys 

1580 1585 1590 

Val Phe Leu Glu Leu Ser Lys Glu Gin Glu Val Gly Asn Phe Asp 

1595 1600 1605 

Glu Glu He Asp Thr Thr Met Arg Trp Lys Leu Leu Pro His Ser 

1610 1615 1620 

Asp Glu Pro 



<210> 18 

<211> 611 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7510082CD1 

<400> 18 

Met Pro Ala Pro Arg Ala Arg Glu Gin Pro Arg Val Pro Gly Glu 
1 5 io 15 

Arg Gin Pro Leu Leu Pro Arg Gly Ala Arg Gly Pro Arg Arg Trp 
20 25 ^ 30 

Arg Arg Ala Ala Gly Ala Ala Val Leu Leu Val Glu Met Leu Glu 
35 40 45 

Arg Ala Ala Phe Phe Gly Val Thr Ala Asn Leu Val Leu Tyr Leu 
50 55 ~ 60 

Asn Ser Thr Asn Phe Asn Trp Thr Gly Glu Gin Ala Thr Arg Ala 
65 70 75 

Ala Leu Val Phe Leu Gly Ala Ser Tyr Leu Leu Ala Pro Val Gly 
80 85 90 

Gly Trp Leu Ala Asp Val Tyr Leu Gly Arg Tyr Arg Ala Val Ala 
95 100 105 

Leu Ser Leu Leu Leu Tyr Leu Ala Ala Ser Gly Leu Leu Pro Ala 
HO 115 120 

Thr Ala Phe Pro Asp Gly Arg Ser Ser Phe Cys Gly Glu Met Pro 
125 130 135 

Ala Ser Pro Leu Gly Pro Ala Cys Pro Ser Ala Gly Cys Pro Arg 
140 145 i5 0 

Ser Ser Pro Ser Pro Tyr Cys Ala Pro Val Leu Tyr Ala Gly Leu 
155 160 165 

Leu Leu Leu Gly Leu Ala Ala Ser Ser Val Arg Ser Asn Leu Thr 
I 70 175 180 
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Ser Phe Gly Ala Asp Gin Val Met Asp Leu Gly Arg Asp Ala Thr 

185 190 195 

Arg Arg Phe Phe Asn Trp Phe Tyr Trp Ser lie Asn Leu Gly Ala 

200 205 210 

Val Leu Ser Leu Leu Val Val Ala Phe lie Gin Gin Asn lie Ser 

215 220 225 

Phe Leu Leu Gly Tyr Ser lie Pro Val Gly Cys Val Gly Leu Ala 

230 235 240 

Phe Phe lie Phe Leu Phe Ala Thr Pro Val Phe lie Thr Lys Pro 

245 250 255 

Pro Met Gly Ser Gin Val Ser Ser Met Leu Lys Leu Ala Leu Gin 

260 265 270 

Asn Cys Cys Pro Gin Leu Trp Gin Arg His Ser Ala Arg Asp Arg 

275 280 285 

Gin Cys Ala Arg Val Leu Ala Asp Glu Arg Ser Pro Gin Pro Gly 

290 295 300 

Ala Ser Pro Gin Glu Asp lie Ala Asn Phe Gin Val Leu Val Lys 

305 310 315 

lie Leu Pro Val Met Val Thr Leu Val Pro Tyr Trp Met Val Tyr 

320 325 330 

Phe Gin Met Gin Ser Thr Tyr Val Leu Gin Gly Leu His Leu His 

335 340 345 

lie Pro Asn lie Phe Pro Ala Asn Pro Ala Asn lie Ser Val Ala 

350 355 360 

Leu Arg Ala Gin Gly Ser Ser Tyr Thr lie Pro Glu Ala Trp Leu 

365 370 375 

Leu Leu Ala Asn Val Val Val Val Leu lie Leu Val Pro Leu Lys 

380 385 390 

Asp Arg Leu He Asp Pro Leu Leu Leu Arg Cys Lys Leu Leu Pro 

395 400 405 

Ser Ala Leu Gin Lys Met Ala Leu Gly Met Phe Phe Gly Phe Thr 

410 415 420 

Ser Val He Val Ala Gly Val Leu Glu Met Glu Arg Leu His Tyr 

425 430 435 

He His His Asn Glu Thr Val Ser Gin Gin He Gly Glu Val Leu 

440 445 450 

Tyr Asn Ala Ala Pro Leu Ser He Trp Trp Gin He Pro Gin Tyr 

455 460 465 

Leu Leu He Gly He Ser Glu He Phe Ala Ser He Pro Gly Leu 

470 475 480 

Glu Phe Ala Tyr Ser Glu Ala Pro Arg Ser Met Gin Gly Ala He 

485 490 495 

Met Gly He Phe Phe Cys Leu Ser Gly Val Gly Ser Leu Leu Gly 

500 505 510 

Ser Ser Leu Val Ala Leu Leu Ser Leu Pro Gly Gly Trp Leu His 

515 520 525 

Cys Pro Lys Asp Phe Gly Asn He Asn Asn Cys Arg Met Asp Leu 

530 * 535 540 

Tyr Phe Phe Leu Leu Ala Gly He Gin Ala Val Thr Ala Leu Leu 

545 550 555 

Phe Val Trp He Ala Gly Arg Tyr Glu Arg Ala Ser Gin Gly Pro 

560 565 570 

Ala Ser His Arg Pro Phe Gin His Gly Gin Gly Leu Asp Arg Pro 

575 580 585 

Tyr Pro Gly Pro Leu Val Tyr Ser Thr Gly Lys Asn Gly Ser Ser 

590 595 600 
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Pro Ser Ser Gly Phe Leu Leu Gly Leu Phe Cys 
605 610 

<210> 19 

<211> 55 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 75103 67CD1 

<400> 19 

Met Thr Gly Gin Gly Gin Ser Ala Ser Gly Ser Ser Ala Trp Ser 
15 10 15 

Thr Val Phe Arg His Val Arg Tyr Glu Asn Leu He Ala Gly Val 

20 25 30 

Ser Gly Gly Val Leu Ser Asn Leu Ala Leu His Pro Leu Asp Leu 

35 40 45 

Val Lys He Arg Phe Ala Gly Thr He Leu 

50 55 

<210> 20 
<211> 287 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_JE eature 

<223> Incyte ID No: 7510413CD1 

<400> 20 

Met Asp Met Ala Trp Gin Met Met Gin Leu Leu Leu Leu Ala Leu 
1 5 10 15 

Val Thr Ala Ala Gly Ser Ala Gin Pro Arg Ser Ala Arg Ala Arg 
20 25 30 

Thr Asp Leu Leu Asn Val Cys Met Asn Ala Lys His His Lys Thr 
35 40 ^ 45 

Gin Pro Ser Pro Glu Asp Glu Leu Tyr Gly Gin Val Gly Ala Pro 
50 55 60 

Gin Gly Pro Ser Pro Gly Ser Val Pro Leu Asp Asp Leu Pro Gly 
65 70 75 

Ala Glu Glu Pro Glu Tyr Gly Gly Asp Gly Cys Gly Gly Glu Arg 
80 85 90 

Leu Ser Pro Val Ser Ser Pro Pro Ser Ala Val Pro Gly Arg Arg 
95 100 105 

Met Pro Ala Ala Arg Pro Ala Pro Ala Arg Ser Cys Thr Arg Thr 
110 115 120 

Pro Pro Ala Cys Thr Thr Leu Thr Gly He Thr Val Val Arg Trp 
125 130 135 

Asn Pro Pro Ala Ser Ala Thr Leu Ser Arg Thr Ala Val Ser Glu 
140 145 iso 

Cys Ser Pro Asn Leu Gly Pro Trp He Arg Gin Val Asn Gin Ser 
155 160 165 

Trp Arg Lys Glu Arg He Leu Asn Val Pro Leu Cys Lys Glu Asp 
170 175 iso 
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Cys 


Glu 


Arg Trp 


Trp 


Glu Asp Cys Arg Thr 


Ser Tyr Thr Cys Lys 








185 


190 


195 


Ser 


Asn 


Trp His 


Lys 


Gly Trp Asn Trp Thr 


Ser Gly lie Asn Glu 








200 


205 


210 


Cys 


Pro 


Ala Gly 


Ala 


Leu Cys Ser Thr Phe 


Glu Ser Tyr Phe Pro 








215 


220 


225 


Thr 


Pro 


Ala Ala 


Leu 


Cys Glu Gly Leu Trp 


Ser His Ser Phe Lys 








230 


235 


240 


Val 


Ser 


Asn Tyr 


Ser 


Arg Gly Ser Gly Arg 


Cys lie Gin Met Trp 








245 


250 


255 


Phe 


Asp 


Ser Ala 


Gin 


Gly Asn Pro Asn Glu 


Glu Val Ala Lys Phe 








260 


265 


270 


Tyr 


Ala 


Ala Ala 


Met 


Asn Ala Gly Ala Pro 


Ser Arg Gly lie lie 








275 


280 


285 


Asp 


Ser 











<210> 21 
<211> 55 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1721303CD1 

<400> 21 

Met Ala Ser Val Gly Glu Cys Pro Ala Pro Val Pro Val Lys Asp 
15 10 15 

Lys Lys Leu Leu Glu Val Lys Leu Gly Glu Leu Pro Ser Trp lie 

20 25 30 

Leu Met Arg Asp Phe Ser Pro Ser Gly lie Phe Gly Ala Phe Gin 

35 40 45 

Arg Glu His Glu Arg Leu Arg Lys Tyr His 

50 55 

<210> 22 
<211> 272 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 7502007CD1 

<400> 22 

Met Gly Ser Gly His Cys Leu Arg Ser Thr Arg Gly Ser Lys Met 
1 5 10 15 

Val Ser Trp Ser Val lie Ala Lys lie Gin Glu lie Leu Gin Arg 

20 25 30 

Lys Met Val Arg Glu Phe Leu Ala Glu Phe Met Ser Thr Tyr Val 

35 40 45 

Met Met Val Phe Gly Leu Gly Ser Val Ala His Met Val Leu Asn 

50 55 60 

Lys Lys Tyr Gly Ser Tyr Leu Gly Val Asn Leu Gly Phe Gly Phe 

65 70 75 
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Gly Val Thr Met Gly Val His Val Ala Gly Arg He Ser Gly Ala 
80 85 90 

His Met Asn Ala Ala Val Thr Phe Ala Asn Cys Ala Leu Gly Arg 
95 100 105 

Val Pro Trp Arg Lys Phe Pro Val Tyr Val Leu Gly Gin Phe Leu 

110 115 120 

Gly Ser Phe Leu Ala Ala Ala Thr He Tyr Ser Leu Phe Tyr Thr 

125 130 135 

Ala He Leu His Phe Ser Gly Gly Gin Leu Met Val Thr Gly Pro 

140 145 150 

Val Ala Thr Ala Gly He Phe Ala Thr Tyr Leu Pro Asp His Met 

155 160 165 

Thr Leu Trp Arg Gly Phe Leu Asn Glu Ala Trp Leu Thr Gly Met 

170 175 180 

Leu Gin Leu Cys Leu Phe Ala He Thr Asp Gin Glu Asn Asn Pro 

185 190 195 

Ala Leu Pro Gly Thr Glu Ala Leu Val He Gly He Leu Val Val 

200 205 210 

He He Gly Val Ser Leu Gly Met Asn Thr Gly Tyr Ala lie Asn 

215 220 225 

Pro Ser Arg Asp Leu Pro Pro Arg He Phe Thr Phe He Ala Gly 

230 235 240 

Trp Gly Lys Gin Val Phe Arg Trp His His Leu Pro Gly Leu His 

245 250 255 

Trp Leu His His Pro Thr Gly Ala Pro Glu He Gly Gly Phe Cys 

260 265 270 

Gly Val 



<210> 23 

<211> 188 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7506439CD1 

<400> 23 

Met Leu Gly Lys Leu Ala Met Leu Leu Trp Val Gin Gin Ala Leu 
15 10 15 

Leu Ala Leu Leu Leu Pro Thr Leu Leu Ala Gin Gly Glu Ala Arg 

20 25 30 

Arg Ser Arg Asn Thr Thr Arg Pro Ala Leu Leu Arg Leu Ser Asp 

35 40 45 

Tyr Leu Leu Thr Asn Tyr Arg Lys Gly Val Arg Pro Val Arg Asp 

50 55 60 

Trp Arg Lys Pro Thr Thr Val Ser He Asp Val He Val Tyr Ala 

65 70 75 

He Leu Asn Val Asp Glu Lys Asn Gin Val Leu Thr Thr Tyr He 

80 85 90 

Trp Tyr Arg Gin Tyr Trp Thr Asp Glu Phe Leu Gin Trp Asn Pro 

95 100 105 

Glu Asp Phe Asp Asn He Thr Lys Leu Ser lie Pro Thr Asp Ser 
HO 115 120 

He Trp Val Pro Asp He Leu He Asn Glu Phe Val Asp Val Gly 
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125 130 135 

Lys Ser Pro Asn lie Pro Tyr Val Tyr lie Arg His Gin His Leu 

140 145 150 

Phe Val Ala Leu Ala Arg Lys Gly Glu He Arg Gin Glu Cys Leu 

155 160 165 

His Glu Pro Gly Arg Val Gly Val Ala Gly Gly Ala Ala Leu Leu 

170 175 180 

Ser Gly Val Gin His Gly Lys Gin 

185 



<210> 24 




<211> 111 




<212> PRT 




<213> Homo sapiens 




<220> 




<221> mis cofeature 




<223> Incyte ID No: 7509243CD1 




<400> 24 




Met Pro Ser Ala Gly Leu Cys Ser Cys Trp 


Gly Gly Arg Val Leu 


15 10 


15 


Pro Leu Leu Leu Ala Tyr Val Cys Tyr Leu 


Leu Leu Gly Ala Thr 


20 25 


30 


He Phe Gin Leu Leu Glu Arg Gin Ala Glu 


Ala Gin Ser Arg Asp 


35 40 


45 


Gin Phe Gin Leu Glu Lys Leu Arg Phe Leu 


Glu Asn Tyr Thr Cys 


50 55 


60 


Leu Asp Gin Trp Ala Met Glu Gin Phe Val 


Gin Val He Met Glu 


65 70 


75 


Ala Trp Val Lys Gly Val Asn Pro Lys Gly 


Asn Ser Thr Asn Pro 


80 85 


90 


Ser Asn Trp Asp Phe Gly Ser Ser Phe Phe 


Phe Ala Gly Thr Val 


95 100 


105 


Val Thr Thr He Gly His 




110 





<210> 25 

<211> 46 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc__feature 

<223> Incyte ID No: 7509404CD1 

<400> 25 

Met Lys Phe Leu Leu Thr Thr Ala Phe Leu He Leu He Ser Leu 
1 J 5 10 15 

Trp Val Glu Glu Ala Tyr Ser Lys Glu Lys Ser Ser Lys Lys Gly 

20 25 30 

Lys Gly Lys Lys Lys Gin Tyr Leu Cys Pro Ser Glu Arg Leu Tyr 

35 40 45 

His 
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<210> 26 

<211> 204 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7509439CD1 



<400> 26 



Met 


Phe 


Ser Arg 


Ala 


Gly 


Val 


Ala Gly Leu Ser Ala Trp Thr Leu 


1 






5 






10 15 


Gin 


Pro 


Gin Trp 


He 
20 


Gin 


Val 


Arg Asn Met Ala Thr Leu Lys Asp 
25 30 


He 


Thr 


Arg Arg 


Leu 
35 


Lys 


Ser 


He Lys Asn He Gin Lys He Thr 
40 45 


Lys 


Ser 


Met Lys 


Met 
50 


Val 


Ala 


Ala Ala Lys Tyr Ala Arg Ala Glu 
55 60 


Arg 


Glu 


Leu Lys 


Pro 
65 


Ala 


Arg 


He Tyr Gly Leu Gly Ser Leu Ala 
70 75 


Leu 


Tyr 


Glu Lys 


Ala 
80 


Asp 


He 


Lys Gly Pro Glu Asp Lys Lys Lys 
85 90 


His 


Leu 


Leu He 


Gly 
95 


Val 


Ser 


Ser Asp Arg Gly Leu Cys Gly Ala 
100 105 


He 


His 


Ser Ser 


He 
110 


Ala 


Lys 


Gin Met Lys Ser Glu Val Ala Thr 
115 120 


Leu 


Thr 


Ala Ala 


Gly 
125 


Lys 


Glu 


Val Met Leu Val Gly He Gly Asp 
130 135 


Lys 


He 


Arg Gly 


He 
140 


Leu 


Tyr 


Ser Ser Leu Gin Val Leu Lys Glu 
145 150 


Arg 


Asn 


Asp Asp 


Ser 
155 


Val 


Trp 


Asn Asn Ser Gly Asn His His His 
160 165 


Pro 


Tyr 


Pro Lys 


Asp 
170 


Leu 


He 


His Gly Leu He Leu Thr Ser Phe 
175 180 


Trp 


Trp 


His Ser 


Lys 
185 


Lys 


Trp 


Glu Glu Ser Pro Pro Leu Leu Glu 
190 195 


Met 


Arg 


Gin Ser 


Leu 
200 


Pro 


Leu 


Asn Tyr 



<210> 27 

<211> 1400 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7510202CD1 



<400> 27 

Met Ser Lys Arg Arg Met Ser Val Gly Gin Gin Thr Trp Ala Leu 
15 10 15 

Leu Cys Lys Asn Cys Leu Lys Lys Trp Arg Met Lys Arg Gin Thr 

20 25 30 

Leu Leu Glu Trp Leu Phe Ser Phe Leu Leu Val Leu Phe Leu Tyr 

35 40 45 

Leu Phe Phe Ser Asn Leu His Gin Val His Asp Thr Pro Gin Met 
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Ser Ser Met 
Tyr Val He 
Met Asn Lys 
Met Gly Trp 
Tyr Ser He 
Tyr His Leu 
Glu His Arg 
Met Lys Cys 
Phe Gin Ala 
His Ser Val 
Lys He Leu 
Phe He Phe 
Val Ser Val 
Met Thr Met 
Gly Leu Met 
Ala Leu He 
Val Met Val 
Thr Leu Ala 
Thr Gly Leu 
Gly Phe Pro 
Thr Leu Cys 
Gin Leu He 
Asp Ser Ser 
Leu Val Phe 
Asp Lys He 
Phe Phe Leu 
His Val Val 
Asp Cys Phe 



50 

Asp Leu Gly 
65 

Ala Phe Ala 
80 

Val Ala Ser 
95 

Pro Asp Glu 

110 
Asp Ala Val 

125 
Lys Phe Ser 

140 
Asp His Ser 

155 

Glu Gly Ser 

170 
Ala He Asn 

185 
Met Glu Gin 

200 
Pro Phe Val 

215 
Phe Cys He 

. 230 
Asn Val Thr 

245 
Met Gly Leu 

260 
Tyr Ala Gly 

275 
Val Lys Ser 

290 
Phe Thr Leu 

305 
Phe Leu Met 

320 
Val Val Phe 

335 
Ala Leu Tyr 

350 
Leu Leu Ser 

365 
His Leu Asp 

380 
Gin Asn Pro 

395 
Asp Thr Leu 

410 
Leu Pro Ala 

425 
Lys Ser Cys 

440. 
Leu Glu Asn 

455 
Glu Pro Val 



Arg Val Asp 
Pro Glu Ser 
Ala Pro Phe 
Lys Ser Met 
Arg Val He 
Trp Gly His 
Ala His Cys 
Glu Phe Trp 
Ala Ala He 
Leu Met Ser 
Ala Gin Gly 
He Ser Phe 
Gin Glu Arg 
Arg Glu Ser 
Phe He Leu 
Ala Gin He 
Phe Leu Leu 
Ser Val Leu 
Leu Leu He 
Thr His Leu 
Pro Phe Ala 
Tyr Asp Val 
Tyr Leu He 
Leu Tyr Leu 
Glu Tyr Gly 
Phe Trp Phe 
Glu Thr Asp 
Ser Pro Glu 



55 

Ser Phe Asn 
70 

Lys Thr Thr 
85 

Leu Lys Gly 
100 

Asp Glu Leu 
115 

Phe Thr Asp 
130 

Arg He Pro 
145 

Gin Ala Val 
160 

Glu Lys Gly 
175 

He Glu He 
190 

Val Thr Gly 
205 

Gly Val Ala 
220 

Ser Thr Phe 
235 

Gin Tyr He 
250 

Ala Phe Trp 
265 

He Met Ala 
280 

Val Val Leu 
295 

Tyr Gly Leu 
310 

He Lys Lys 
325 

Val Phe Trp 
340 

Pro Ala Phe 
355 

Phe Thr Val 
370 

Asn Ser Asn 
385 

He Ala Thr 
400 

Val Leu Thr 
415 

His Arg Cys 
430 

Gin His Gly 
445 

Ser Asp Pro 
460 

Phe Cys Gly 



60 

Asp Thr Asn 
75 

Gin Glu He 
90 

Arg Thr He 
105 

Asp Leu Asn 
120 

Thr Phe Ser 
135 

Met Met Lys 
150 

Asn Glu Lys 
165 

Phe Val Ala 
180 

Ala Thr Asn 
195 

Val His Met 
210 

Thr Asp Phe 
225 

He Tyr Tyr 
240 

Thr Ser Leu 
255 

Leu Ser Trp 
270 

Thr Leu Met 
285 

Thr Gly Phe 
300 

Ser Leu He 
315 

Pro Phe Leu 
330 

Gly He Leu 
345 

Leu Glu Trp 
360 

Gly Met Ala 
375 

Ala His Leu 
390 

Leu Phe Met 
405 

Leu Tyr Phe 
420 

Ser Pro Leu 
435 

Arg Ala Asn 
450 

Thr Pro Asn 
465 

Lys Glu Ala 



27/115 



WO 03/083085 



PCT/US03/09797 



470 475 480 

lie Arg lie Lys Asn Leu Lys Lys Glu Tyr Ala Gly Lys Cys Glu 
485 490 495 

Arg Val Glu Ala Leu Lys Gly Val Val Phe Asp lie Tyr Glu Gly 
500 505 510 

Gin He Thr Ala Leu Leu Gly His Ser Gly Ala Gly Lys Thr Thr 
515 520 525 

Leu Leu Asn He Leu Ser Gly Leu Ser Val Pro Thr Ser Gly Ser 
530 535 540 

Val Thr Val Tyr Asn His Thr Leu Ser Arg Met Ala Asp He Glu 
545 550 555 

Asn He Ser Lys Phe Thr Gly Phe Cys Pro Gin Ser Asn Val Gin 

560 565 570 

Phe Gly Phe Leu Thr Val Lys Glu Asn Leu Arg Leu Phe Ala Lys 

575 580 585 

He Lys Gly He Leu Pro His Glu Val Glu Lys Glu Val Leu Leu 

590 595 600 

Leu Asp Glu Pro Thr Ala Gly Leu Asp Pro Leu Ser Arg His Arg 

605 610 615 

He Trp Asn Leu Leu Lys Glu Gly Lys Ser Asp Arg Val He Leu 

620 625 630 

Phe Ser Thr Gin Phe He Asp Glu Ala Asp He Leu Ala Asp Arg 

635 640 645 

Lys Val Phe He Ser Asn Gly Lys Leu Lys Cys Ala Gly Ser Ser 

650 655 660 

Leu Phe Leu Lys Lys Lys Trp Gly He Gly Tyr His Leu Ser Leu 

665 670 675 

His Leu Asn Glu Arg Cys Asp Pro Glu Ser He Thr Ser Leu Val 

680 685 690 

Lys Gin His He Ser Asp Ala Lys Leu Thr Ala Gin Ser Glu Glu 

695 700 705 

Lys Leu Val Tyr He Leu Pro Leu Glu Arg Thr Asn Lys Phe Pro 

710 715 720 

Glu Leu Tyr Arg Asp Leu Asp Arg Cys Ser Asn Gin Gly He Glu 

725 730 735 

Asp Tyr Gly Val Ser He Thr Thr Leu Asn Glu Val Phe Leu Lys 

740 745 750 

Leu Glu Gly Lys Ser Thr He Asp Glu Ser Asp He Gly He Trp 

755 760 765 

Gly Gin Leu Gin Thr Asp Gly Ala Lys Asp He Gly Ser Leu Val 

770 775 780 

Glu Leu Glu Gin Val Leu Ser Ser Phe His Glu Thr Arg Lys Thr 

785 790 795 

He Ser Gly Val Ala Leu Trp Arg Gin Gin Val Cys Ala He Ala 

800 805 810 

Lys Val Arg Phe Leu Lys Leu Lys Lys Glu Arg Lys Ser Leu Trp 

815 820 825 

Thr He Leu Leu Leu Phe Gly He Ser Phe He Pro Gin Leu Leu 

830 835 840 

Glu His Leu Phe Tyr Glu Ser Tyr Gin Lys Ser Tyr Pro Trp Glu 

845 850 855 

Leu Ser Pro Asn Thr Tyr Phe Leu Ser Pro Gly Gin Gin Pro Gin 

860 865 870 

Asp Pro Leu Thr His Leu Leu Val He Asn Lys Thr Gly Ser Thr 

875 880 885 

He Asp Asn Phe Leu His Ser Leu Arg Arg Gin Asn He Ala He 



28/115 



WO 03/083085 



890 895 900 

Glu Val Asp Ala Phe Gly Thr Arg Asn Gly Thr Asp Asp Pro Ser 
905 910 915 

Tyr Asn Gly Ala lie lie Val Ser Gly Asp Glu Lys Asp His Arg 
920 925 930 

Phe Ser lie Ala Cys Asn Thr Lys Arg Leu Asn Cys Phe Pro Val 
935 940 945 

Leu Leu Asp Val lie Ser Asn Gly Leu Leu Gly lie Phe Asn Ser 
950 955 960 

Ser Glu His lie Gin Thr Asp Arg Ser Thr Phe Phe Glu Glu His 
965 970 975 

Met Asp Tyr Glu Tyr Gly Tyr Arg Ser Asn Thr Phe Phe Trp lie 
980 985 990 

Pro Met Ala Ala Ser Phe Thr Pro Tyr lie Ala Met Ser Ser lie 
995 1000 1005 

Gly Asp Tyr Lys Lys Lys Ala His Ser Gin Leu Arg lie Ser Gly 

1010 1015 1020 

Leu Tyr Pro Ser Ala Tyr Trp Phe Gly Gin Ala Leu Val Asp Val 

1025 1030 1035 

Ser Leu Tyr Phe Leu lie Leu Leu Leu Met Gin He Met Asp Tyr 

1040 1045 1050 

He Phe Ser Pro Glu Glu He He Phe He He Gin Asn Leu Leu 

1055 1060 1065 

He Gin He Leu Cys Ser He Gly Tyr Val Ser Ser Pro Val Phe 

1070 ' 1075 1080 

Leu Thr Tyr Val He Ser Phe He Phe Arg Asn Gly Arg Lys Asn 

1085 1090 1095 

Ser Gly He Trp Ser Phe Phe Phe Leu He Val Val He Phe Ser 

1100 1105 1110 

He Val Ala Thr Asp Leu Asn Glu Tyr Gly Phe Leu Gly Leu Phe 

1115 1120 1125 

Phe Gly Thr Met Leu He Pro Pro Phe Thr Leu He Gly Ser Leu 

1130 1135 1140 

Phe He Phe Ser Glu He Ser Pro Asp Ser Met Asp Tyr Leu Gly 

1145 1150 1155 

Ala Ser Glu Ser Glu He Val Tyr Leu Ala Leu Leu He Pro Tyr 

1160 1165 1170 

Leu His Phe Leu He Phe Leu Phe He Leu Arg Cys Leu Glu Met 

1175 1180 1185 

Asn Cys Arg Lys Lys Leu Met Arg Lys Asp Pro Val Phe Arg He 

1190 1195 1200 

Ser Pro Arg Ser Asn Ala He Phe Pro Asn Pro Glu Glu Pro Glu 

1205 1210 1215 

Gly Glu Glu Glu Asp He Gin Met Glu Arg Met Arg Thr Val Asn 

1220 1225 1230 

Ala Met Ala Val Arg Asp Phe Asp Glu Thr Pro Val He He Ala 

1235 1240 1245 

Ser Cys Leu Arg Lys Glu Tyr Ala Gly Lys Lys Lys Asn Cys Phe 

1250 1255 1260 

Ser Lys Arg Lys Lys Thr He Ala Thr Arg Asn Val Ser Phe Cys 

1265 1270 1275 

Val Lys Lys Gly Glu Val He Gly Leu Leu Gly His Asn Gly Ala 

1280 1285 1290 

Gly Lys Ser Thr Thr He Lys Met He Thr Gly Asp Thr Lys Pro 

1295 1300 1305 

Thr Ala Gly Gin Val He Leu Lys Gly Ser Gly Gly Gly Glu Pro 
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1310 1315 1320 

Leu Gly Phe Leu Gly Tyr Cys Pro Gin Glu Asn Ala Leu Trp Pro 

1325 1330 1335 

Asn Leu Thr Val Arg Gin His Leu Glu Val Tyr Ala Ala Val Lys 

1340 1345 1350 

Gly Leu Arg Lys Gly Asp Ala Met He Ala He Thr Arg Leu Val 

1355 1360 1365 

Asp Ala Leu Lys Leu Gin Asp Gin Leu Lys Ala Pro Val Lys Thr 

1370 1375 1380 

Leu Ser Glu Gly He Lys Arg Lys Val Arg Ala Gly Leu Val Val 

1385 1390 1395 

Ala Leu Gin Val Pro 

1400 

<210> 28 
<211> 438 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 7510203CD1 

<400> 28 

Met Gin Ala Thr Arg Asn Ala Ala Asp Trp Trp Leu Ser His Trp 
15 10 15 

He Ser Gin Leu Lys Ala Glu Asn Ser Ser Gin Glu Ala Gin Pro 

20 25 30 

Ser Thr Ser Pro Ala Ser Met Gly Leu Phe Ser Pro Gin Leu Leu 

35 40 45 

Leu Phe Ser Pro Gly Asn Leu Tyr He Pro Val Phe Pro Leu Pro 

50 55 60 

Lys Ala Ala Pro Asn Gly Ser Ser Asp He Arg Phe Tyr Leu Thr 
65 70 75 

Val Tyr Ala Thr He Ala Gly Val Asn Ser Leu Cys Thr Leu Leu 
80 85 90 

Arg Ala Val Leu Phe Ala Ala Gly Thr Leu Gin Ala Ala Ala Thr 
95 100 105 

Leu His Arg Arg Leu Leu His Arg Val Leu Met Ala Pro Val Thr 

110 H5 120 

Phe Phe Asn Ala Thr Pro Thr Gly Arg He Leu Asn Arg Phe Ser 

125 130 135 

Ser Asp Val Ala Cys Ala Asp Asp Ser Leu Pro Phe He Leu Asn 

140 145 150 

He Leu Leu Ala Asn Ala Ala Gly Leu Leu Gly Leu Leu Ala Val 

155 160 165 

Leu Gly Ser Gly Leu Pro Trp Leu Leu Leu Leu Leu Pro Pro Leu 

170 175 180 

Ser He Met Tyr Tyr His Val Gin Arg His Tyr Arg Ala Ser Ser 

185 190 195 

Arg Glu Leu Arg Arg Leu Gly Ser Leu Thr Leu Ser Pro Leu Tyr 

200 205 210 

Ser His Leu Ala Asp Thr Leu Ala Gly Leu Ser Val Leu Arg Ala 

215 220 225 

Thr Gly Ala Thr Tyr Arg Phe Glu Glu Glu Asn Leu Arg Leu Leu 

230 235 240 
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Glu Leu Asn Gin Arg Cys Gin Phe Ala Thr Ser Ala Thr Met Gin 

245 250 255 

Trp Leu Asp He Arg Leu Gin Leu Met Gly Ala Ala Val Val Ser 

260 265 270 
Ala He Ala Gly He Ala Leu Val Gin His Gin Gin Gly Leu Ala 

275 280 285 

Asn Pro Gly Leu Val Gly Leu Ser Leu Ser Tyr Ala Leu Ser Leu 

290 295 300 

Thr Gly Leu Leu Ser Gly Leu Val Ser Ser Phe Thr Gin Thr Glu 

305 310 315 

Ala Met Leu Val Ser Val Glu Arg Leu Glu Glu Tyr Thr Cys Asp 

320 325 330 

Leu Pro Gin Glu Pro Gin Gly Gin Pro Leu Gin Val Gly Leu Tyr 

335 340 345 
Pro His Pro Arg Pro Lys Leu Trp Asn Pro Glu Gly Pro Ser Leu 

350 355 360 

Pro His Asn Ser Phe Leu Phe Ala His Pro Ser Phe Ser Ala -Pro 

365 370 375 

He Thr Ser Leu His Asp Asp His Asn Ser Ser Pro Cys Pro Phe 

380 385 390 

Phe Pro He Ser His Ser Leu He Pro Leu Thr Leu Ser He Ser 

395 400 405 

His Tyr Ser Pro Leu Leu Thr He Ala Pro His Leu Pro Tyr Leu 

410 415 420 

Pro Phe Pro Val Cys Leu Pro Pro Met Asp Pro Thr Ser Trp Ala 

425 430 435 

Pro Ala Gly 



<210> 29 
<211> 871 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7510208CD1 

<400> 29 

Met Gly Phe Leu His Gin Leu Gin Leu Leu Leu Trp Lys Asn Val 

15 10 15 
Thr Leu Lys Arg Arg Ser Pro Trp Val Leu Ala Phe Glu He Phe 

20 25 30 
He Pro Leu Val Leu Phe Phe He Leu Leu Gly Leu Arg Gin Lys 

35 40 45 

Lys Pro Thr He Ser Val Lys Glu Val Ser Phe Tyr Thr Ala Ala 

50 55 60 

Pro Leu Thr Ser Ala Gly He Leu Pro Val Met Gin Ser Leu Cys 

65 70 75 

Pro Asp Gly Gin Arg Asp Glu Phe Gly Phe Leu Gin Tyr Ala Asn 

80 85 90 

Ser Thr Val Thr Gin Leu Leu Glu Arg Leu Asp Arg Val Val Glu 

95 100 105 

Glu Gly Asn Leu Phe Asp Pro Ala Arg Pro Ser Leu Gly Ser Glu 

110 115 120 

Leu Glu Ala Leu Arg Gin His Leu Glu Ala Leu Ser Ala Gly Pro 
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125 130 135 

Gly Thr Ser Gly Ser His Leu Asp Arg Ser Thr Val Ser Ser Phe 
140 145 150 

Ser Leu Asp Ser Val Ala Arg Asn Pro Gin Glu Leu Trp Arg Phe 
155 160 165 

Leu Thr Gin Asn Leu Ser Leu Pro Asn Ser Thr Ala Gin Ala Leu 
170 175 180 

Leu Ala Ala Arg Val Asp Pro Pro Glu Val Tyr His Leu Leu Phe 
185 190 195 

Gly Pro Ser Ser Ala Leu Asp Ser Gin Ser Gly Leu His Lys Gly 
200 205 210 

Gin Glu Pro Trp Ser Arg Leu Gly Gly Asn Pro Leu Phe Arg Met 
215 220 225 

Glu Glu Leu Leu Leu Ala Pro Ala Leu Leu Glu Gin Leu Thr Cys 
230 235 240 

Thr Pro Gly Ser Gly Glu Leu Gly Arg He Leu Thr Val Pro Glu 
245 250 255 

Ser Gin Lys Gly Ala Leu Gin Gly Tyr Arg Asp Ala Val Cys Ser 
260 265 270 

Gly Gin Ala Ala Ala Arg Ala Arg Arg Phe Ser Gly Leu Ser Ala 
275 280 285 

Glu Leu Arg Asn Gin Leu Asp Val Ala Lys Val Ser Gin Gin Leu 
290 295 300 

Gly Leu Asp Ala Pro Asn Gly Ser Asp Ser Ser Pro Gin Ala Pro 
305 310 315 

Pro Pro Arg Arg Leu Gin Ala Leu Leu Gly Asp Leu Leu Asp Ala 
320 325 330 

Gin Lys Val Leu Gin Asp Val Asp Val Leu Ser Ala Leu Ala Leu 
335 340 345 

Leu Leu Pro Gin Gly Ala Cys Thr Gly Arg Thr Pro Gly Pro Pro 
350 355 360 

Ala Ser Gly Ala Gly Gly Ala Ala Asn Gly Thr Gly Ala Gly Ala 
365 370 375 

Val Met Gly Pro Asn Ala Thr Ala Glu Glu Gly Ala Pro Ser Ala 
380 385 390 

Ala Ala Leu Ala Thr Pro Asp Thr Leu Gin Gly Gin Cys Ser Ala 
395 400 405 

Phe Val Gin Leu Trp Ala Gly Leu Gin Pro He Leu Cys Gly Asn 
410 415 420 

Asn Arg Thr He Glu Pro Glu Ala Leu Arg Arg Gly Asn Met Ser 
425 430 435 

Ser Leu Gly Phe Thr Ser Lys Glu Gin Arg Asn Leu Gly Leu Leu 
440 445 450 

Val His Leu Met Thr Ser Asn Pro Lys He Leu Tyr Ala Pro Ala 
455 460 465 

Gly Ser Glu Val Asp Arg Val He Leu Lys Ala Asn Glu Thr Phe 
470 475 480 

Ala Phe Val Gly Asn Val Thr His Tyr Ala Gin Val Trp Leu Asn 
485 490 495 

He Ser Ala Glu He Arg Ser Phe Leu Glu Gin Gly Arg Leu Gin 
500 505 510 

Gin His Leu Arg Trp Leu Gin Gin Tyr Val Ala Glu Leu Arg Leu 
515 520 525 

His Pro Glu Ala Leu Asn Leu Ser Leu Asp Glu Leu Pro Pro Ala 
530 535 540 

Leu Arg Gin Asp Asn Phe Ser Leu Pro Ser Gly Met Ala Leu Leu 
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545 550 555 

Gin Gin Leu Asp Thr lie Asp Asn Ala Ala Cys Gly Trp He Gin 

560 565 570 

Phe Met Ser Lys Val Ser Val Asp He Phe Lys Gly Phe Pro Asp 

575 580 585 

Glu Glu Ser He Val Asn Tyr Thr Leu Asn Gin Ala Tyr Gin Asp 

590 595 600 

Asn Val Thr Val Phe Ala Ser Val He Phe Gin Thr Arg Lys Asp 

605 610 615 

Gly Ser Leu Pro Pro His Val His Tyr Lys He Arg Gin Asn Ser 

620 625 630 

Ser Phe Thr Glu Lys Thr Asn Glu He Arg Arg Ala Tyr Trp Arg 

635 640 645 

Pro Gly Pro Asn Thr Gly Gly Arg Phe Tyr Phe Leu Tyr Gly Phe 

650 655 660 

Val Trp lie Gin Asp Met Met Glu Arg Ala He He Asp Thr Phe 

665 670 675 

Val Gly His Asp Val Val Glu Pro Gly Ser Tyr Val Gin Met Phe 

680 685 690 

Pro Tyr Pro Cys Tyr Thr Arg Asp Asp Phe Leu Phe Val He Glu 

695 700 705 

His Met Met Pro Leu Cys Met Val He Ser Trp Val Tyr Ser Val 

710 715 720 

Ala Met Thr He Gin His He Val Ala Glu Lys Glu His Arg Leu 

725 730 735 

Lys Glu Val Arg Gly Pro Gly Leu Ser Leu Glu Ala Arg Ala Gly 

740 745 750 

Arg Glu Gly Arg Arg Pro Pro Arg Gly Leu Pro Gin Ala Pro Gly 

755 760 765 

Pro Pro Ala Gly Asp Glu Asp His Gly Pro Glu Gin Arg Gly Ala 

770 775 780 

Leu Gly Gly Leu Val His His Arg Leu Cys Ala Ala Val His Leu 

785 790 795 

Arg Asp Ser Thr His Arg His Pro Glu Val Arg Pro Gly Ala Tyr 

800 805 810 

Ala Gin Pro Arg Gly His His Leu Ala Leu Pro Gly Ser Leu Arg 

815 820 825 

Gly Gly His His His Val Leu Leu Pro Gly Val Cys Ala Val Leu 

830 835 840 

Gin Gly Gin Ala Gly Leu Gly Leu Arg Trp His His Leu Leu Pro 

845 850 855 

Glu Leu Arg Ala Leu His Val Arg Gly Asp Pro Arg Gly Gly Gly 

860 865 870 

Ala 



<210> 30 
<211> 104 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7510446CD1 

<400> 30 
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Met Glu Gin Ser Arg Ser Gin Gin Arg Gly Gly Glu Gin Ser Trp 
1 5 10 15 

Trp Gly Ser Asp Pro Gin Tyr Gin Tyr Met Pro Phe Glu His Cys 

20 25 30 

Thr Ser Tyr Gly Leu Pro Ser Glu Asn Gly Gly Leu Gin His Arg 

35 40 45 

Leu Arg Lys Asp Ala Gly Pro Arg His Asn Val His Pro Thr Gin 

50 55 60 

He Tyr Gly His His Lys Glu Gin Phe Ser Asp Arg Glu Gin Asp 

65 70 75 

He Gly Met Pro Lys Lys Thr Gly Ser Ser Ser Thr Val Asp Ser 

80 85 90 

Lys Asp Glu Asp His Tyr Ser Lys Cys Gin Gly Asp Gly Asp 

95 100 

<210> 31 

<211> 336 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7505294CD1 

<400> 31 

Met Ala Ser Asp Pro He Phe Thr Leu Ala Pro Pro Leu His Cys 
15 10 15 

His Tyr Gly Ala Phe Pro Pro Asn Ala Ser Gly Trp Glu Gin Pro 
20 25 30 

Pro Asn Ala Ser Gly Val Ser Val Ala Ser Ala Ala Leu Ala Ala 
35 40 45 

Ser Ala Ala Ser Arg Val Ala Thr Ser Thr Asp Pro Ser Cys Ser 
50 55 60 

Gly Phe Ala Pro Pro Asp Phe Asn His Cys Leu Lys Asp Trp Asp 
65 70 75 

Tyr Asn Gly Leu Pro Val Leu Thr Thr Asn Ala He Gly Gin Trp 
80 85 90 

Asp Leu Val Cys Asp Leu Gly Trp Gin Val He Leu Glu Gin He 
95 100 105 

Leu Phe He Leu Gly Phe Ala Ser Gly Tyr Leu Phe Leu Gly Tyr 
110 H5 I 20 

Pro Ala Asp Arg Phe Gly Arg Arg Gly He Val Leu Leu Thr Leu 
125 130 135 

Gly Leu Val Gly Pro Cys Gly Val Gly Gly Ala Ala Ala Gly Ser 
140 145 150 

Ser Thr Gly Val Met Ala Leu Arg Phe Leu Leu Gly Phe Leu Leu 
155 160 165 

Ala Gly Val Asp Leu Gly Val Tyr Leu Met Arg Leu Glu Leu Cys 
170 175 180 

Asp Pro Thr Gin Arg Leu Arg Val Ala Leu Ala Gly Glu Leu Val 
185 190 195 

Gly Val Gly Gly His Phe Leu Phe Leu Gly Leu Ala Leu Val Ser 
200 205 21° 

Lys Asp Trp Arg Phe Leu Gin Arg Met He Thr Ala Pro Cys He 
215 220 225 

Leu Phe Leu Phe Tyr Gly Trp Pro Gly Leu Phe Leu Glu Ser Ala 
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230 235 240 

Arg Trp Leu He Val Lys Arg Gin He Glu Glu Ala Gin Ser Val 

245 250 255 

Leu Arg He Leu Ala Glu Arg Asn Arg Pro His Gly Gin Met Leu 

260 265 270 

Gly Glu Glu Ala Gin Glu Ala Leu Gin Ala Ser Leu Pro Met Pro 

275 280 285 

Phe Ala Thr Ala Thr Ser Leu Trp Glu Glu Glu Gly Ala His Arg 

290 295 300 

Thr Ser Thr Cys Ala Leu Cys Trp Pro Ala Ala Pro Gin Pro Trp 

305 310 315 

Pro Val Ser Ser Trp Gly Ser Pro Trp Thr Asp Leu Ala Ala Gly 

320 325 330 

Ala Ser Phe Phe Ser Pro 

335 

<210> 32 
<211> 271 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7505631CD1 

<400> 32 

Met Asp Asp Phe He Ser He Ser Leu Leu Ser Leu Ala Met Leu 
15 10 15 

Val Gly Cys Tyr Val Ala Gly He He Pro Leu Ala Val Asn Phe 
20 25 30 

Ser Glu Glu Arg Leu Lys Leu Val Thr Val Leu Gly Ala Gly Leu 
35 40 45 

Leu Cys Gly Thr Ala Leu Ala Val He Val Pro Glu Gly Val His 
50 55 60 

Ala Leu Tyr Glu Asp He Leu Glu Gly Lys His His Gin Ala Ser 
65 70 75 

Glu Thr His Asn Val He Ala Ser Asp Lys Ala Ala Glu Lys Ser 
80 85 90 

Val Val His Glu His Glu His Ser His Asp His Thr Gin Leu His 
95 100 105 

Ala Tyr He Gly Val Ser Leu Val Leu Gly Phe Val Phe Met Leu 
110 115 120 

Leu Val Asp Gin He Gly Asn Ser His Val His Ser Thr Asp Asp 
125 130 135 

Pro Glu Ala Ala Arg Ser Ser Asn Ser Lys He Thr Thr Thr Leu 
140 145 150 

Gly Leu Val Val His Ala Ala Ala Asp Gly Val Ala Leu Gly Ala 
155 160 165 

Ala Ala Ser Thr Ser Gin Thr Ser Val Gin Leu He Val Phe Val 
170 175 180 

Ala He Met Leu His Lys Ala Pro Ala Ala Phe Gly Leu Val Ser 
185 190 195 

Phe Leu Met His Ala Gly Leu Glu Arg Asn Arg He Arg Lys His 
200 205 210 

Leu Leu Val Phe Ala Leu Ala Ala Pro Val Met Ser Met Val Thr 
215 220 225 
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Tyr Leu Gly Leu Ser Lys Ser Ser Lys Glu Ala Leu Ser Glu Val 

230 235 240 

Asn Ala Thr Gly Val Ala Met Leu Phe Ser Ala Gly Thr Phe Leu 

245 250 255 

Tyr Val Ala Thr Val Arg Lys Val Ala Gin lie Gly Tyr Ser Cys 

260 265 270 

Met 



<210> 33 
<211> 107 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7506561CD1 

<400> 33 

Met Ala Trp Gin Met Met Gin Leu Leu Leu Leu Ala Leu Val Thr 
15 10 15 

Ala Ala Gly Ser Ala Gin Pro Arg Ser Ala Arg Ala Arg Thr Asp 

20 25 30 

Leu Leu Asn Val Cys Met Asn Ala Lys His His Lys Thr Gin Pro 

35 40 45 

Ser Pro Glu Asp Glu Leu Tyr Gly Gin Cys Ser Pro Trp Lys Lys 

50 55 60 

Asn Ala Cys Cys Thr Ala Ser Thr Ser Gin Glu Leu His Lys Asp 

65 70 75 

Thr Ser Arg Leu Tyr Asn Phe Asn Trp Asp His Cys Gly Gin Pro 

80 85 90 

Glu Leu Ala Gin Arg Ala His Ser Glu Arg Ala Pro Val Gin Arg 

95 100 105 

Gly Leu 



<210> 34 

<211> 249 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7510733CD1 

<400> 34 

Met Gin Pro Glu Gly Ala Glu Lys Gly Lys Ser Phe Lys Gin Arg 
15 10 15 

Leu Val Leu Lys Ser Ser Leu Ala Lys Glu Thr Leu Ser Glu Phe 

20 25 30 

Leu Gly Thr Phe He Leu He Val Leu Gly Cys Gly Cys Val Ala 

35 40 45 

Gin Ala He Leu Ser Arg Gly Arg Phe Gly Gly Val He Thr He 

50 55 60 

Asn Val Gly Phe Ser Met Ala Val Ala Met Ala He Tyr Val Ala 

65 70 75 
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Gly 


Gly 


Val 


Ser 


Asp 


Gly 


Leu 


Met Ser 


Phe 


Ala Gly Gly Lys 


Leu 










80 








85 




90 


Leu 


He 


Val 


Gly 


Glu 


Asn 


Ala 


Thr Ala 


His 


He Phe Ala Thr 


Tyr 










95 








100 




105 


Pro 


Ala 


Pro 


Tyr 


Leu 


Ser 


Leu 


Ala Asn 


Ala 


Phe Ala Asp Gin 


Val 










110 








115 




120 


Val 


Ala 


Thr 


Met 


He 


Leu 


Leu 


He He 


Val 


Phe Ala He Phe 


Asp 










125 








130 




135 


Ser 


Arg 


Asn 


Leu 


Gly 


Ala 


Pro 


Arg Gly 


Leu 


Glu Pro He Ala 


He 










140 








145 




150 


Gly 


Leu 


Leu 


He 


He 


Val 


He 


Ala Ser 


Ser 


Leu Gly Leu Asn 


Ser 










155 








160 




165 


Gly 


Cys 


Ala 


Met 


Asn 


Pro 


Ala 


Arg Asp 


Leu 


Ser Pro Arg Leu 


Phe 










170 








175 




180 


Thr 


Ala 


Leu 


Ala 


Gly 


Trp 


Gly 


Phe Glu 


Val 


Phe Arg Ala Gly 


Asn 










185 








190 




195 


Asn 


Phe 


Trp Trp 


He 


Pro 


Val 


Val Gly 


Pro 


Leu Val Gly Ala 


Val 










200 








205 




210 


lie 


Gly 


Gly Leu 


He 


Tyr 


Val 


Leu Val 


He 


Glu He His His 


Pro 










215 








220 




225 


Glu 


Pro 


Asp 


Ser 


Val 


Phe 


Lys 


Ala Glu 


Gin 


Ser Glu Asp Lys 


Pro 










230 








235 




240 


Glu 


Lys 


Tyr 


Glu 


Leu 


Ser 


Val 


He Met 









245 

<210> 35 
<211> 216 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7510734CD1 

<400> 35 

Met Gin Pro Glu Gly Ala Glu Lys Gly Lys Ser Phe Lys Gin Arg 
15 10 15 

Leu Val Leu Lys Ser Ser Leu Ala Lys Glu Thr Leu Ser Glu Phe 

20 25 30 

Leu Gly Thr Phe He Leu He Val Leu Gly Cys Gly Cys Val Ala 

35 40 45 

Gin Ala He Leu Ser Arg Gly Arg Phe Gly Gly Val He Thr He 

50 55 60 

Asn Val Gly Phe Ser Met Ala Val Ala Met Ala He Tyr Val Ala 

65 70 75 

Gly Gly Val Ser Gly Gly His He Asn Pro Ala Val Ser Leu Ala 

80 85 90 

Met Cys Leu Phe Gly Arg Met Lys Trp Phe Lys Leu Pro Phe Tyr 

95 100 105 

Val Gly Ala Gin Phe Leu Gly Ala Phe Val Gly Ala Ala Thr Val 
110 115 120 

Phe Gly He Tyr Tyr Asp Gly Leu Met Ser Phe Ala Gly Gly Lys 
125 130 135 

Leu Leu He Val Gly Glu Asn Ala Thr Ala His He Phe Ala Thr 
140 145 150 

Tyr Pro Ala Pro Tyr Leu Ser Leu Ala Asn Ala Phe Ala Asp Gin 
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155 



160 



165 



Lys Leu Gly Ser Pro Gin Arg Pro Arg Ala His Cys His Arg Pro 

170 175 180 

Pro Asp Tyr Cys His Cys Phe Leu Pro Gly Thr Glu Gin Trp Leu 

185 190 195 

Cys His Glu Pro Ser Ser Arg Pro Glu Ser Gin Thr Phe His Cys 

200 205 210 

Leu Gly Arg Leu Gly Val 



<210> 36 

<211> 223 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7503977CD1 

<400> 36 

Met Ala Ser Thr Gly Gly Thr Lys Val Val Ala Met Gly Val Ala 
15 10 15 

Pro Trp Gly Val Val Arg Asn Arg Asp Thr Leu He Asn Pro Lys 
20 25 30 

Gly Ser Phe Pro Ala Arg Tyr Arg Trp Arg Gly Asp Pro Glu Asp 
35 40 45 

Gly Val Gin Phe Pro Leu Asp Tyr Asn Tyr Ser Ala Phe Phe Leu 
50 55 60 

Val Asp Asp Gly Thr His Gly Cys Leu Gly Gly Glu Asn Arg Phe 
65 70 75 

Arg Leu Arg Leu Glu Ser Tyr He Ser Gin Gin Lys Thr Gly Val 
80 85 90 

Gly Gly Thr Gly He Asp He Pro Val Leu Leu Leu Leu He Asp 
95 100 105 

Gly Asp Glu Lys Met Leu Thr Arg He Glu Asn Ala Thr Gin Ala 
110 115 120 

Gin Leu Pro Cys Leu Leu Val Ala Gly Ser Gly Gly Ala Ala Asp 
125 130 135 

Cys Leu Ala Glu Thr Leu Glu Asp Thr Leu Ala Pro Gly Ser Gly 
140 145 150 

Gly Ala Arg Gin Gly Glu Ala Arg Asp Arg He Arg Arg Phe Phe 
155 160 165 

Pro Lys Gly Asp Leu Glu Val Leu Gin Ala Gin Val Glu Arg He 
170 175 180 

Met Thr Arg Lys Glu Leu Leu Thr Val Tyr Ser Ser Glu Asp Gly 
185 190 195 

Ser Glu Glu Phe Glu Thr He Val Leu Lys Ala Leu Val Lys Val 
200 205 210 

Leu Pro Ser Arg Ser Phe Pro His Gly Arg Pro Ala Glu 



215 



215 



220 



<210> 37 
<211> 394 
<212> PRT 



<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 7505084CD1 

<400> 37 

Met Glu Ser Gly Thr Ser Ser Pro Gin Pro Pro Gin Leu Asp Pro 
15 10 15 

Leu Asp Ala Phe Pro Gin Lys Gly Leu Glu Pro Gly Asp lie Ala 
20 25 30 

Val Leu Val Leu Tyr Phe Leu Phe Val Leu Ala Val Gly Leu Trp 
35 40 45 

Ser Thr Val Lys Thr Lys Arg Asp Thr Val Lys Gly Tyr Phe Leu 
50 55 60 

Ala Gly Gly Asp Met Val Trp Trp Pro Val Gly Ala Ser Leu Phe 
65 70 75 

Ala Ser Asn Val Gly Ser Gly His Phe lie Gly Leu Ala Gly Ser 
80 85 90 

Gly Ala Ala Thr Gly He Ser Val Ser Ala Tyr Glu Leu Asn Gly 
95 100 105 

Leu Phe Ser Val Leu Met Leu Ala Trp He Phe Leu Pro He Tyr 

110 115 120 

He Ala Gly Gin Val Thr Thr Met Pro Glu Tyr Leu Arg Lys Arg 

125 130 135 

Phe Gly Gly He Arg He Pro He He Leu Ala Val Leu Tyr Leu 

140 145 150 

Phe He Tyr He Phe Thr Lys He Ser Val Asp Met Tyr Ala Gly 

155 160 165 

Ala He Phe He Gin Gin Ser Leu His Leu Asp Leu Tyr Leu Ala 

170 175 180 

He Val Gly Leu Leu Ala He Thr Ala Val Tyr Thr Val Ala Gly 

185 190 195 

Gly Leu Ala Ala Val He Tyr Thr Asp Ala Leu Gin Thr Leu He 

200 205 210 

Met Leu He Gly Ala Leu Thr Leu Met Gly Tyr Ser Phe Ala Ala 

215 220 225 

Val Gly Gly Met Glu Gly Leu Lys Glu Lys Tyr Phe Leu Ala Leu 

230 235 240 

Ala Ser Asn Arg Ser Glu Asn Ser Ser Cys Gly Leu Pro Arg Glu 

245 250 255 

Asp Ala Phe His He Phe Arg Asp Pro Leu Thr Ser Asp Leu Pro 

260 265 270 

Trp Pro Gly Val Leu Phe Gly Met Ser lie Pro Ser Leu Trp Tyr 

275 280 285 

Trp Cys Thr Asp Gin Val He Val Gin Arg Thr Leu Ala Ala Lys 

290 295 300 

Asn Leu Ser His Ala Lys Gly Gly Ala Leu Met Ala Ala Tyr Leu 

305 310 315 

Lys Val Leu Pro Leu Phe He Met Val Phe Pro Gly Met Val Ser 

320 325 330 

Arg He Leu Phe Pro Asp Gin Val Ala Cys Ala Asp Pro Glu He 

335 340 345 

Cys Gin Lys He Cys Ser Asn Pro Ser Gly Cys Ser Asp He Ala 

350 355 360 

Tyr Pro Lys Leu Val Leu Glu Leu Leu Pro Thr Val Pro Ala Pro 

3 65 370 375 

Ser Ser Pro Trp Thr Ser Gly He Thr Ser Gly Leu Gly His Leu 
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380 385 390 

Arg Arg Ser Ser 



<210> 38 

<211> 202 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7506950CD1 

<400> 38 

Met Lys Thr Lys Leu Asn lie Tyr Asn Met Gin Phe Leu Leu Phe 

15 10 15 

Val Phe Leu Val Trp Asp Pro Ala Arg Leu Val Leu Ala Asn lie 

20 25 30 

Gin Glu Asp Glu Ala Lys Asn Asn lie Thr lie Phe Thr Arg lie 

35 40 45 

Leu Asp Arg Leu Leu Asp Gly Tyr Asp Asn Arg Leu Arg Pro Gly 

50 55 60 

Leu Gly Asp Ser lie Thr Glu Val Phe Thr Asn lie Tyr Val Thr 

65 70 75 

Ser Phe Gly Pro Val Ser Asp Thr Asp Met Glu Tyr Thr lie Asp 

80 85 90 

Val Phe Phe Arg Gin Lys Trp Lys Asp Glu Arg Leu Lys Phe Lys 

95 100 105 

Gly Pro Met Asn lie Leu Arg Leu Asn Asn Leu Met Ala Ser Lys 

110 115 120 

lie Trp Thr Pro Asp Thr Phe Phe His Asn Gly Lys Lys Ser Val 

125 130 135 

Ala His Asn Met Thr Met Pro Asn Lys Leu Leu Arg He Gin Asp 

140 145 150 

Asp Gly Thr Leu Leu Tyr Thr Met Arg Ser Asn Asn Cys Pro Asn 

155 160 165 

Asn Asp Asn Ser Lys His Gin Cys Ser Glu Phe Ser Pro Gin Ser 

170 175 180 

Gly Leu Cys Asn Cys His Gly Leu Val Tyr Cys Cys Leu Leu Cys 

185 190 195 

He Cys Val Leu Cys Pro Asn 
200 

<210> 39 

<211> 337 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 7506951CD1 

<400> 39 

Met Lys Thr Lys Leu Asn He Tyr Asn Met Gin Phe Leu Leu Phe 

15 10 15 

Val Phe Leu Val Trp Asp Pro Ala Arg Leu Val Leu Ala Asn He 
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20 25 30 

Gin Glu Asp Glu Ala Lys Asn Asn He Thr He Phe Thr Arg He 
35 40 45 

Leu Asp Arg Leu Leu Asp Gly Tyr Asp Asn Arg Leu Arg Pro Gly 
50 55 60 

Leu Gly Asp Ser He Thr Glu Val Phe Thr Asn He Tyr Val Thr 
65 70 75 

Ser Phe Gly Pro Val Ser Asp Thr Asp Met Glu Tyr Thr He Asp 
80 85 90 

Val Phe Phe Arg Gin Lys Trp Lys Asp Glu Arg Leu Lys Phe Lys 
95 100 105 

Gly Pro Met Asn He Leu Arg Leu Asn Asn Leu Met Ala Ser Lys 

110 115 120 

He Trp Thr Pro Asp Thr Phe Phe His Asn Gly Lys Lys Ser Val 

125 130 135 

Ala His Asn Met Thr Met Pro Asn Lys Leu Leu Arg He Gin Asp 

140 145 150 

Asp Gly Thr Leu Leu Tyr Thr Met Arg Leu Thr Val Gin Ala Glu 

155 160 165 

Cys Pro Met His Leu Glu Asp Phe Pro Met Asp Ala His Ser Cys 

170 175 180 

Pro Leu Lys Phe Gly Ser Tyr Ala Tyr Thr Thr Ser Glu Val Thr 

185 190 195 

Tyr He Trp Thr Tyr Asn Ala Ser Asp Ser Val Gin Val Ala Pro 

200 205 210 

Asp Gly Ser Arg Leu Asn Gin Tyr Asp Leu Leu Gly Gin Ser He 

215 220 225 

Gly Lys Glu Thr He Lys Ser Ser Thr Gly Glu Tyr Thr Val Met 

230 235 240 

Thr Ala His Phe His Leu Lys Arg Lys He Gly Tyr Phe Val He 

245 250 255 

Gin Thr Tyr Leu Pro Cys He Met Thr Val He Leu Ser Gin Val 

260 265 270 

Ser Phe Trp Leu Asn Arg Glu Ser Val Pro Ala Arg Thr Val Phe 

275 280 285 

Glu Lys Arg Lys Gly Phe Arg Tyr Asp Thr Glu Gin Arg Leu Cys 

290 295 300 

Ser Gly Cys Cys Gin Leu Cys Pro Glu Ser Phe Lys Arg Ser Ser 

305 310 315 

Ser Leu His His Leu Gin Glu Cys Asn His Ala Arg Thr Gin Gin 

320 325 330 

Glu Ala Arg Lys Gin Ala Ser 

335 

<210> 40 
<211> 114 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 7506954CD1 

<400> 40 

Met Lys Thr Lys Leu Asn He Tyr Asn Met Gin Phe Leu Leu Phe 
15 10 15 
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Val Phe Leu Val Trp Asp Pro Ala Arg Leu Val Leu Ala Asn lie 

20 25 30 

Gin Glu Asp Glu Ala Lys Asn Asn lie Thr lie Phe Thr Arg lie 

35 40 45 

Leu Asp Arg Leu Leu Asp Gly Tyr Asp Asn Arg Leu Arg Pro Gly 

50 55 ~ 60 

Leu Gly Glu Lys Arg Lys Gly Phe Arg Tyr Asp Thr Glu Gin Arg 

65 70 75 

Leu Cys Ser Gly Cys Cys Gin Leu Cys Pro Glu Ser Phe Lys Arg 

80 85 90 

Ser Ser Ser Leu His His Leu Gin Glu Cys Asn His Ala Arg Thr 

95 100 105 

Gin Gin Glu Ala Arg Lys Gin Ala Ser 
110 

<210> 41 

<211> 400 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7506956CD1 

<400> 41 

Met Lys Thr Lys Leu Asn lie Tyr Asn Met Gin Phe Leu Leu Phe 
1 5 , 10 15 

Val Phe Leu Val Trp Asp Pro Ala Arg Leu Val Leu Ala Asn He 
20 25 30 

Gin Glu Asp Glu Ala Lys Asn Asn He Thr He Phe Thr Arg He 
35 40 45 

Leu Asp Arg Leu Leu Asp Gly Tyr Asp Asn Arg Leu Arg Pro Gly 
50 55 60 

Leu Gly Asp Ser He Thr Glu Val Phe Thr Asn He Tyr Val Thr 
65 70 " 75 

Ser Phe Gly Pro Val Ser Asp Thr Asp Met Glu Tyr Thr He Asp 
80 85 90 

Val Phe Phe Arg Gin Lys Trp Lys Asp Glu Arg Leu Lys Phe Lys 
95 100 105 

Gly Pro Met Asn He Leu Arg Leu Asn Asn Leu Met Ala Ser Lys 
HO 115 120 

He Trp Thr Pro Asp Thr Phe Phe His Asn Gly Lys Lys Ser Val 
125 130 135 

Ala His Asn Met Thr Met Pro Asn Lys Leu Leu Arg He Gin Asp 
140 145 150 

Asp Gly Thr Leu Leu Tyr Thr Met Arg Leu Thr Val Gin Ala Glu 
155 160 165 

Cys Pro Met His Leu Glu Asp Phe Pro Met Asp Ala His Ser Cys 
170 175 180 

Pro Leu Lys Phe Gly Ser Tyr Ala Tyr Thr Thr Ser Glu Val Thr 
185 190 195 

Tyr He Trp Thr Tyr Asn Ala Ser Asp Ser Val Gin Val Ala Pro 
200 205 210 

Asp Gly Ser Arg Leu Asn Gin Tyr Asp Leu Leu Gly Gin Ser He 
215 220 225 

Gly Lys Glu Thr He Lys Ser Ser Thr Gly Val Thr Thr Val Leu 



42/115 



WO 03/083085 



230 235 240 

Thr Met Thr Thr Leu Ser He Ser Ala Arg Asn Ser Leu Pro Lys 

245 250 255 

Val Ala Tyr Ala Thr Ala Met Asp Trp Phe He Ala Val Cys Tyr 

260 265 270 

Ala Phe Val Phe Ser Ala Leu He Glu Phe Ala Thr Val Asn Tyr 

275 280 285 

Phe Thr Lys Arg Gly Trp Ala Trp Asp Gly Lys Ser Val Val Asn 

290 295 300 

Asp Lys Lys Lys Glu Lys Ala Ser Val Met He Gin Asn Asn Ala 

305 310 315 

Tyr Ala Val Ala Val Ala Asn Tyr Ala Pro Asn Leu Ser Lys Asp 

320 325 330 

Pro Val Leu Ser Thr He Ser Lys Ser Ala Thr Thr Pro Glu Pro 

335 340 345 

Asn Lys Lys Pro Glu Asn Lys Pro Ala Glu Ala Lys Lys Thr Phe 

350 355 360 

Asn Ser Val Ser Lys He Asp Arg Met Ser Arg He Val Phe Pro 

365 370 375 

Val Leu Phe Gly Thr Phe Asn Leu Val Tyr Trp Ala Thr Tyr Leu 

380 385 390 

Asn Arg Glu Pro Val Leu Gly Val Ser Pro 

395 400 

<210> 42 
<211> 403 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc„f eature 

<223> Incyte ID No: 7506959CD1 

<400> 42 

Met Lys Thr Lys Leu Asn He Tyr Asn Met Gin Phe Leu Leu Phe 
15 10 15 

Val Phe Leu Val Trp Asp Pro Ala Arg Leu Val Leu Ala Asn He 

20 25 30 

Gin Glu Asp Glu Ala Lys Asn Asn He Thr He Phe Thr Arg He 

35 40 45 

Leu Asp Arg Leu Leu Asp Gly Tyr Asp Asn Arg Leu Arg Pro Gly 

50 55 60 

Leu Gly Asp Ser He Thr Glu Val Phe Thr Asn He Tyr Val Thr 

65 70 75 

Ser Phe Gly Pro Val Ser Asp Thr Asp Met Glu Tyr Thr He Asp 

80 85 90 

Val Phe Phe Arg Gin Lys Trp Lys Asp Glu Arg Leu Lys Phe Lys 

95 100 105 

Gly Pro Met Asn He Leu Arg Leu Asn Asn Leu Met Ala Ser Lys 
HO 115 120 

He Trp Thr Pro Asp Thr Phe Phe His Asn Gly Lys Lys Ser Val 
125 130 135 

Ala His Asn Met Thr Met Pro Asn Lys Leu Leu Arg He Gin Asp 
( 140 145 150 

Asp Gly Thr Leu Leu Tyr Thr Met Arg Leu Thr Val Gin Ala Glu 
155 160 165 
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220 




225 


Leu Asn Arg 


Glu Ser Val 


Pro Ala Arg 


Thr 


Val Phe Gly Val 


Thr 










235 




240 


inr vai Lieu. 


Thr Met Thr 


Thr Leu 


Ser 


lie 


Ser Ala Arg Asn Ser 




245 






250 




255 


Leu Pro Lys 


Val Ala Tyr 


Ala Thr 


Ala 


Met 


Asp Trp Phe He 


Ala 




260 






265 




270 


Val Cys Tyx 


Ala Phe Val 


Phe Ser 


Ala 


Leu 


He Glu Phe Ala 


Thr 




275 






280 




285 


val Asn Tyx 


Phe Thr Lys 


Arg Gly Trp 


Ala 


Trp Asp Gly Lys 


Ser 




290 






295 




300 


val val Asn 


Asp Lys Lys 


Lys Glu 


Lys 


Ala 


Ser Val Met He 


Gin 




one 

305 






310 




315 


Asn Asn Ala 


Tyr Ala Val 


Ala Val 


Ala 


Asn 


Tyr Ala Pro Asn Leu 




320 






325 




330 


Ser Lys Asp 


Pro Val Leu 


Ser Thr 


He 


Ser 


Lys Ser Ala Thr 


Thr 










340 




345 


Pro Glu Pro 


Asn Lys Lys 


Pro Glu 


Asn 


Lys 


Pro Ala Glu Ala 


Lys 




350 






355 




360 


Lys Thr Phe 


Asn Ser Val 


Ser Lys 


He 
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Arg Met Ser Arg 


He 




365 






370 
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Val Phe Pro 


Val Leu Phe 


Gly Thr 


Phe 


Asn 


Leu Val Tyr Trp 
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380 






385 




390 


Thr Tyr Leu 


Asn Arg Glu 


Pro Val 


Leu 


Gly 


Val Ser Pro 






395 






400 






<210> 43 














<211> 66 














<212> PRT 














<213> Homo sapiens 












<220> 














<221> misc_feature 












<223> Incyte ID No: 7506960CD1 










<400> 43 














Met Lys Thr Lys Leu Asn He Tyr Asn 


Met 


Gin Phe Leu Leu 


Phe 


1 


5 






10 




15 


Val Phe Leu 


Val Trp Asp 


Pro Ala 


Arg 


Leu 


Val Leu Ala Asn 


He 




20 






25 




30 


Gin Glu Asp Glu Ala Lys 


Asn Asn 


He 


Thr 


He Phe Thr Arg 


He 




35 






40 




45 


Leu Asp Arg 


Leu Leu Asp Gly Tyr Asp 


Asn 


Arg Leu Arg Pro Gly 




50 






55 




60 



Leu Gly Gly He Tyr Asn 
65 

<210> 44 
<211> 89 
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<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7510540CD1 

<400> 44 

Met Thr Glu Asp Lys Val Thr Gly Thr Leu Val Phe Thr Val lie 
15 10 15 

Thr Ala Val Leu Gly Ser Phe Gin Phe Gly Tyr Asp He Gly Val 

20 25 30 

He Asn Ala Pro Gin Gin Asn Gin Ser His Val Ser Ser Lys His 

35 40 45 

Ser Val He Ser Trp Ser Ser Leu Asp Gly Val Phe Lys He Gly 

50 55 60 

Thr He Ser Tyr Thr Tyr Asn Cys Trp Lys Lys His He Arg Thr 

65 70 75 

He Leu Trp Ala Asn Phe Arg Pro Gly Ser Tyr Val Tyr Arg 

80 85 



<210> 45 

<211> 146 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7510545CD1 



<400> 45 

Met Glu Asn Ala His Thr Lys Thr Val Glu Glu Val Leu Gly His 
15 10 15 

Phe Gly Val Asn Glu Ser Glu Ser Val Ser Val He Lys His Thr 

20 25 30 

Asp Pro Val Pro Asp Pro Arg Ala Val Asn Gin Asp Lys Lys Asn 

35 40 45 

Met Leu Phe Ser Val Ala Leu Ala Val Ala Ala He Pro Glu Gly 

50 55 60 

Leu Pro Ala Val He Thr Thr Cys Leu Ala Leu Gly Thr Arg Arg 

65 70 75 

Met Ala Lys Lys Asn Ala He Val Arg Ser Leu Pro Ser Val Glu 

80 85 90 

Thr Leu Gly Cys Thr Ser Val He Cys Ser Asp Lys Thr Gly Thr 

95 100 105 

Leu Thr Thr Asn Gin Met Ser Val Cys Arg Met Phe He Leu Asp 
110 115 120 

Arg Val Glu Asp His Thr Ala Glu Arg Asp Pro Val Ala Asp Gly 
125 130 135 

Ala Glu Asn Leu Leu Ala Arg Asp Ser His Gly 
140 145 



<210> 46 
<211> 353 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 7510654CD1 

<400> 46 

Met Thx Pro Glu Asp Pro Glu Glu Thr Gin Pro Leu Leu Gly Pro 

15 10 15 
Pro Gly Gly Ser Ala Pro Arg Gly Arg Arg Val Phe Leu Ala Ala 

20 25 30 

Phe Ala Ala Ala Leu Gly Pro Leu Ser Phe Gly Phe Ala Leu Gly 

35 40 45 

Tyr Ser Ser Pro Ala lie Pro Ser Leu Gin. Arg Ala Ala Pro Pro 

50 55 60 

Ala Pro Arg Leu Asp Asp Ala Ala Ala Ser Trp Phe Gly Ala Val 

65 70 75 

Val Thr Leu Gly Ala Ala Ala Gly Gly Val Leu Gly Gly Trp Leu 

80 85 90 

Val Asp Arg Ala Gly Arg Lys Leu Ser Leu Leu Leu Cys Ser Val 

95 100 105 

Pro Phe Val Ala Gly Phe Ala Val lie Thr Ala Ala Gin Asp Val 

110 115 120 

Trp Met Leu Leu Gly Gly Arg Leu Leu Thr Gly Leu Ala Cys Gly 

125 130 135 

Val Ala Ser Leu Val Ala Pro Val Tyr lie Ser Glu He Ala Tyr 

140 145 150 

Pro Ala Val Arg Gly Leu Leu Gly Ser Cys Val Gin Leu Met Val 

155 160 165 
Val Val Gly He Leu Leu Ala Tyr Leu Ala Gly Trp Val Leu Glu 

170 175 180 

Trp Arg Trp Leu Ala Val Leu Gly Cys Val Pro Pro Ser Leu Met 

185 190 195 

Leu Leu Leu Met Cys Phe Met Pro Glu Thr Pro Arg Phe Leu Leu 

200 205 210 

Thr Gin His Arg Arg Gin Glu Ala Met Ala Ala Leu Arg Phe Leu 

215 220 225 

Trp Gly Ser Glu Gin Gly Trp Glu Asp Pro Pro He Gly Ala Glu 

230 235 240 

Gin Ser Phe His Leu Ala Leu Leu Arg Gin Pro Gly He Tyr Lys 

245 250 255 

Pro Phe He He Gly Val Ser Leu Met Ala Phe Gin Gin Leu Ser 

260 265 270 

Gly Val Asn Ala Val Met Phe Tyr Ala Glu Thr He Phe Glu Glu 

275 280 285 

Ala Lys Phe Lys Asp Ser Ser Leu Ala Ser Val Val Val Gly Val 

290 295 300 

He Gin Val Leu Phe Thr Ala Val Ala Ala Leu He Met Asp Arg 

305 310 315 

Ala Gly Arg Arg Leu Leu Leu Val Leu Ser Gly Gly Pro Gin Ala 

320 325 330 

Leu Trp Ser Leu Leu Ala Cys Leu Arg Phe Leu His Leu Gin Cys 

335 340 345 
Pro Phe His Phe Val Leu Cys Pro 

350 

<210> 47 
<211> 1155 
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<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 7510660CD1 



<400> 47 

Met Ala Ala Ala Ala Ala Val Gly Asn Ala Val Pro Cys Gly Ala 
15 10 15 

Arg Pro Cys Gly Val Arg Pro Asp Gly Gin Pro Lys Pro Gly Pro 
20 25 30 

Gin Pro Arg Ala Leu Leu Ala Ala Gly Pro Ala Leu He Ala Asn 
35 40 45 

Gly Asp Glu Leu Val Ala Ala Val Trp Pro Tyr Arg Arg Leu Ala 
50 55 60 

Leu Leu Arg Arg Leu Thr Val Leu Pro Phe Ala Gly Leu Leu Tyr 
65 70 75 

Pro Ala Trp Leu Gly Ala Ala Ala Ala Gly Cys Trp Gly Trp Gly 
80 85 90 

Ser Ser Trp Val Gin He Pro Glu Ala Ala Leu Leu Val Leu Ala 
95 100 105 

Thr He Cys Leu Ala His Ala Leu Thr Val Leu Ser Gly His Trp 

110 115 120 

Ser Val His Ala His Cys Ala Leu Thr Cys Thr Pro Glu Tyr Asp 

125 130 135 

Pro Ser Lys Ala Thr Phe Val Lys Val Val Pro Thr Pro Asn Asn 

140 145 150 

Gly Ser Thr Glu Leu Val Ala Leu His Arg Asn Glu Gly Glu Asp 

155 160 165 

Gly Leu Glu Val Leu Ser Phe Glu Phe Gin Lys He Lys Tyr Ser 

170 175 180 

Tyr Asp Ala Leu Glu Lys Lys Gin Phe Leu Pro Val Ala Phe Pro 

185 190 195 

Val Gly Asn Ala Phe Ser Tyr Tyr Gin Ser Asn Arg Gly Phe Gin 
* 200 205 210 

Glu Asp Ser Glu He Arg Ala Ala Glu Lys Lys Phe Gly Ser Asn 

215 220 225 

Lys Ala Glu Met Val Val Pro Asp Phe Ser Glu Leu Phe Lys Glu 

230 235 240 

Arg Ala Thr Ala Pro Phe Phe Val Phe Gin Val Phe Cys Val Gly 

245 250 255 

Leu Trp Cys Leu Asp Glu Tyr Trp Tyr Tyr Ser Val Phe Thr Leu 
. 260 265 270 

Ser Met Leu Val Ala Phe Glu Ala Ser Leu Val Gin Gin Gin Met 

275 280 285 

Arg Asn Met Ser Glu He Arg Lys Met Gly Asn Lys Pro His Met 

290 295 300 

He Gin Val Tyr Arg Ser Arg Lys Trp Arg Pro He Ala Ser Asp 

305 310 315 

Glu He Val Pro Gly Asp He Val Ser He Gly Arg Ser Pro Gin 

320 325 330 

Glu Asn Leu Val Pro Cys Asp Val Leu Leu Leu Arg Gly Arg Cys 

335 340 345 

He Val Asp Glu Ala Met Leu Thr Gly Glu Ser Val Pro Gin Met 

350 355 360 
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Lys Glu Pro lie Glu Asp Leu Ser Pro Asp Arg Val Leu Asp Leu 

365 370 375 

Gin Ala Asp Ser Arg Leu His Val lie Phe Gly Gly Thr Lys Val 

380 385 390 

Val Gin His lie Pro Pro Gin Lys Ala Thr Thr Gly Leu Lys Pro 

395 400 405 

Val Asp Ser Gly Cys Val Ala Tyr Val Leu Arg Thr Gly Phe Asn 

410 415 420 

Thr Ser Gin Gly Lys Leu Leu Arg Thr He Leu Phe Gly Val Lys 

425 430 435 

Arg Val Thr Ala Asn Asn Leu Glu Thr Phe He Phe He Leu Phe 

440 445 450 

Leu Leu Val Phe Ala He Ala Ala Ala Ala Tyr Val Trp He Glu 

455 460 465 

Gly Thr Lys Asp Pro Ser Arg Asn Arg Tyr Lys Leu Phe Leu Glu 

470 475 480 

Cys Thr Leu He Leu Thr Ser Val Val Pro Pro Glu Leu Pro He 

485 490 495 

Glu Leu Ser Leu Ala Val Asn Thr Ser Leu He Ala Leu Ala Lys 

500 505 510 

Leu Tyr Met Tyr Cys Thr Glu Pro Phe Arg He Pro Phe Ala Gly 

515 520 525 

Lys Val Glu Val Cys Cys Phe Asp Lys Thr Gly Thr Leu Thr Ser 

530 535 540 

Asp Ser Leu Val Val Arg Gly Val Ala Gly Leu Arg Asp Gly Lys 

545 550 555 

Glu Val Thr Pro Val Ser Ser He Pro Val Glu Thr His Arg Ala 

560 565 570 

Leu Ala Ser Cys His Ser Leu Met Gin Leu Asp Asp Gly Thr Leu 

575 580 585 

Val Gly Asp Pro Leu Glu Lys Ala Met Leu Thr Ala Val Asp Trp 

590 595 600 

Thr Leu Thr Lys Asp Glu Lys Val Phe Pro Arg Ser He Lys Thr 

605 610 615 

Gin Gly Leu Lys He His Gin Arg Phe His Phe Ala Ser Ala Leu 

620 625 630 9 

Lys Arg Met Ser Val Leu Ala Ser Tyr Glu Lys Leu Gly Ser Thr 

635 640 645 

Asp Leu Cys Tyr He Ala Ala Val Lys Gly Ala Pro Glu Thr Leu 

650 655 660 

His Ser Met Phe Ser Gin Cys Pro Pro Asp Tyr His His He His 

665 670 675 

Thr Glu He Ser Arg Glu Gly Ala Arg Val Leu Ala Leu Gly Tyr 

680 685 690 

Lys Glu Leu Gly His Leu Thr His Gin Gin Val Val Met He Thr 

695 700 705 

Gly Asp Asn Pro Leu Thr Ala Cys His Val Ala Gin Glu Leu His 

710 715 720 

Phe He Glu Lys Ala His Thr Leu He Leu Gin Pro Pro Ser Glu 

725 730 735 

Lys Gly Arg Gin Cys Glu Trp Arg Ser He Asp Gly Ser He Val 

740 745 750 

Leu Pro Leu Ala Arg Gly Ser Pro Lys Ala Leu Ala Leu Glu Tyr 

755 760 765 

Ala Leu Cys Leu Thr Gly Asp Gly Leu Ala His Leu Gin Ala Thr 

770 775 780 
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Asp Pro Gin Gin Leu Leu Arg Leu lie Pro His Val Gin Val Phe 

785 790 795 

Ala Arg Val Ala Pro Lys Gin Lys Glu Phe Val He Thr Ser Leu 

800 805 810 

Lys Glu Leu Gly Tyr Val Thr Leu Met Cys Gly Asp Gly Thr Asn 

815 820 825 

Asp Val Gly Ala Leu Lys His Ala Asp Val Gly Val Ala Leu Leu 

830 835 840 

Ala Asn Ala Pro Glu Arg Val Val Glu Arg Arg Arg Arg Pro Arg 

845 850 855 

Asp Ser Pro Thr Leu Ser Asn Ser Gly He Arg Ala Thr Ser Arg 

860 865 870 

Thr Ala Lys Gin Arg Ser Gly Leu Pro Pro Ser Glu Glu Gin Pro 

875 880 885 

Thr Ser Gin Arg Asp Arg Leu Ser Gin Val Leu Arg Asp Leu Glu 

890 895 900 

Asp Glu Ser Thr Pro He Val Lys Leu Gly Asp Ala Ser He Ala 

905 910 915 

Ala Pro Phe Thr Ser Lys Leu Ser Ser He Gin Cys He Cys His 

920 925 930 

Val He Lys Gin Gly Arg Cys Thr Leu Val Thr Thr Leu Gin Met 

935 940 945 

Phe Lys He Leu Ala Leu Asn Ala Leu He Leu Ala Tyr Ser Gin 

950 955 960 

Ser Val Leu Tyr Leu Glu Gly Val Lys Phe Ser Asp Phe Gin Ala 

965 970 975 

Thr Leu Gin Gly Leu Leu Leu Ala Gly Cys Phe Leu Phe He Ser 

980 985 990 

Arg Ser Lys Pro Leu Lys Thr Leu Ser Arg Glu Arg Pro Leu Pro 

995 1000 1005 

Asn He Phe Asn Leu Tyr Thr lie Leu Thr Val Met Leu Gin Phe 
1010 1015 1020 

Phe Val His Phe Leu Ser Leu Val Tyr Leu Tyr Arg Glu Ala Gin 
1025 1030 1035 

Ala Arg Ser Pro Glu Lys Gin Glu Gin Phe Val Asp Leu Tyr Lys 
1040 1045 1050 

Glu Phe Glu Pro Ser Leu Val Asn Ser Thr Val Tyr He Met Ala 
1055 1060 1065 

Met Ala Met Gin Met Ala Thr Phe Ala He Asn Tyr Lys Gly Pro 
1070 1075 1080 

Pro Phe Met Glu Ser Leu Pro Glu Asn Lys Pro Leu Val Trp Ser 
1085 1090 1095 

Leu Ala Val Ser Leu Leu Ala He He Gly Leu Leu Leu Gly Ser 
1100 1105 1H0 

Ser Pro Asp Phe Asn Ser Gin Phe Gly Leu Val Asp He Pro Val 
1115 1120 1125 

Glu Val Leu Leu Leu Asp Phe Cys Leu Ala Leu Leu Ala Asp Arg 
1130 1135 1140 

Val Leu Gin Phe Phe Leu Gly Thr Pro Lys Leu Lys Val Pro Ser 
1145 1150 H55 



<210> 48 
<211> 606 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 7510661CD1 

<400> 48 

Met Ala Ala Ala Ala Ala Val Gly Asn Ala Val Pro Cys Gly Ala 
1 5 10 15 

Arg Pro Cys Gly Val Arg Pro Asp Gly Gin Pro Lys Pro Gly Pro 

20 25 30 

Gin Pro Arg Ala Leu Leu Ala Ala Gly Pro Ala Leu lie Ala Asn 

35 40 45 

Gly Asp Glu Leu Val Ala Ala Val Trp Pro Tyr Arg Arg Leu Ala 

50 55 60 



Leu Leu 


Arg Arg Leu 


Thr Val 


Leu Pro Phe Ala Gly Leu Leu Tyr 








65 




70 


75 


Pro Ala 


Trp Leu Gly 


Ala Ala 


Ala Ala Gly Cys Trp Gly Trp Gly 








80 




85 


90 


Ser Ser 


Trp 


Val 


Gin 


He Pro 


Glu Ala Ala Leu Leu Val 


Leu Ala 








95 




100 


105 


Thr He 


Cys 


Leu 


Ala 


His Ala 


Leu Thr Val Leu Ser Gly His Trp 








110 




115 


120 


Ser Val 


His 


Ala 


His 


Cys Ala 


Leu Thr Cys Thr Pro Glu Tyr Asp 








125 




130 


135 


Pro Ser 


Lys Ala Thr 


Phe Val 


Lys Val Val Pro Thr Pro 


Asn Asn 








140 




145 


150 


Gly Ser 


Thr 


Glu 


Leu 


Val Ala 


Leu His Arg Asn Glu Gly Glu Asp 








155 




160 


165 


Gly Leu 


Glu 


Val 


Leu 


Ser Phe 


Glu Phe Gin Lys He Lys 


Tyr Ser 








170 




175 


180 


Tyr Asp 


Ala 


Leu 


Glu 


Lys Lys 


Gin Phe Leu Pro Val Ala 


Phe Pro 








185 




190 


195 


Val Gly 


Asn Ala 


Phe 


Ser Tyr 


Tyr Gin Ser Asn Arg Gly Phe Gin 








200 




205 


210 


Glu Asp 


Ser 


Glu 


He 


Arg Ala 


Ala Glu Lys Lys Phe Gly Ser Asn 








215 




220 


225 


Lys Ala 


Glu 


Met 


Val 


Val Pro 


Asp Phe Ser Glu Leu Phe 


Lys Glu 








230 




235 


240 


Arg Ala 


Thr 


Ala 


Pro 


Phe Phe 


Val Phe Gin Val Phe Cys 


Val Gly 








245 




250 


255 


Leu Trp 


Cys 


Leu Asp 


Glu Tyr 


Trp Tyr Tyr Ser Val Phe 


Thr Leu 








260 




265 


270 


Ser Met 


Leu 


Val 


Ala 


Phe Glu 


Ala Ser Leu Val Gin Gin 


Gin Met 








275 




280 


285 


Arg Asn 


Met 


Ser 


Glu 


He Arg 


Lys Met Gly Asn Lys Pro 


His Met 








290 




295 


300 


He Gin 


Val 


Tyr Arg 


Ser Arg 


Lys Trp Arg Pro He Ala 


Ser Asp 








305 




310 


315 


Glu lie 


Val 


Pro Gly 


Asp He 


Val Ser He Gly Arg Ser 


Pro Gin 








320 




325 


330 


Glu Asn 


Leu Val 


Pro 


Cys Asp 


Val Leu Leu Leu Arg Gly Arg Cys 








335 




340 


345 


He Val 


Asp 


Glu 


Ala 


Met Leu 


Thr Gly Glu Ser Val Pro 


Gin Met 








350 




355 


360 


Lys Glu 


Pro 


He 


Glu 


Asp Leu 


Ser Pro Asp Arg Val Leu Asp Leu 








365 




370 


375 


Gin Ala 


Asp 


Ser Arg 


Leu His 


Val He Phe Gly Gly Thr 


Lys Val 
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380 385 390 

Val Gin His lie Pro Pro Gin Lys Ala Thr Thr Gly Leu Lys Pro 

395 400 405 

Val Asp Ser Gly Cys Val Ala Tyr Val Leu Arg Thr Gly Phe Asn 

410 415 420 

Thr Ser Gin Gly Lys Leu Leu Arg Thr He Leu Phe Gly Val Lys 

425 430 435 

Arg Val Thr Ala Asn Asn Leu Glu Thr Phe He Phe He Leu Phe 

440 445 450 

Leu Leu Val Phe Ala He Ala Ala Ala Ala Tyr Val Tip He Glu 

455 460 465 

Gly Thr Lys Asp Pro Ser Arg Asn Arg Tyr Lys Leu Phe Leu Glu 

470 475 480 

Cys Thr Leu He Leu Thr Ser Val Val Pro Pro Glu Leu Pro He 

485 490 495 

Glu Leu Ser Leu Ala Val Asn Thr Ser Leu He Ala Leu Ala Lys 

500 505 510 

Leu Tyr Met Tyr Cys Thr Glu Pro Phe Arg He Pro Phe Ala Gly 

515 520 525 

Lys Val Glu Val Cys Cys Phe Asp Lys Thr Gly Thr Leu Thr Ser 

530 535 540 

Asp Ser Leu Val Val Arg Gly Val Ala Gly Leu Arg Asp Gly Lys 

545 550 555 

Glu Val Thr Pro Val Ser Ser He Pro Val Glu Thr His Arg Ala 

560 565 570 

Leu Ala Ser Cys His Ser Leu Met Gin Leu Asp Asp Gly Thr Leu 

575 580 585 

Val Gly Asp Pro Leu Glu Lys Ala Met Leu Thr Ala Val Asp Trp 

590 595 600 

Thr Leu Thr Lys Val Pro 

605 

<210> 49 
<211> 462 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7510680CD1 

<400> 49 

Met Ala Thr Lys Pro Thr Glu Pro Val Thr He Leu Ser Leu Arg 

15 10 15 

Lys Leu Ser Leu Gly Thr Ala Glu Pro Gin Val Lys Glu Pro Lys 

20 25 30 

Thr Phe Thr Val Glu Asp Ala Val Glu Thr He Gly Phe Gly Arg 

35 40 45 

Phe His He Ala Leu Phe Leu He Met Gly Ser Thr Gly Val Val 

50 55 60 

Glu Ala Met Glu He Met Leu He Ala Val Val Ser Pro Val He 

» 65 70 75 

Arg Cys Glu Trp Gin Leu Glu Asn Trp Gin Val Ala Leu Val Thr 

80 85 90 

Thr Met Val Phe Phe Gly Tyr Met Val Phe Ser He Leu Phe Gly 

95 100 105 
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Leu Leu Ala Asp Arg Tyr Gly Arg Trp Lys lie Leu Leu lie Ser 

110 115 120 

Phe Leu Trp Gly Ala Tyr Phe Ser Leu Leu Thr Ser Phe Ala Pro 

125 130 135 

Ser Tyr He Trp Phe Val Phe Leu Arg Thr Met Val Gly Cys Gly 

140 145 150 

Val Ser Gly His Ser Gin Gly Leu He He Lys Thr Glu Phe Leu 

155 160 165 

Pro Thr Lys Tyr Arg Gly Tyr Met Leu Pro Leu Ser Gin Val Phe 

170 175 180 

Trp Leu Ala Gly Ser Leu Leu He He Gly Leu Ala Ser Val He 

185 190 195 

He Pro Thr He Gly Trp Arg Trp Leu He Arg Val Ala Ser He 

200 205 210 

Pro Gly He He Leu He Val Ala Phe Lys Phe He Pro Glu Ser 

215 220 225 

Ala Arg Phe Asn Val Ser Thr Gly Asn Thr Arg Ala Ala Leu Ala 

230 235 240 

Thr Leu Glu Arg Val Ala Lys Met Asn Arg Ser Val Met Pro Glu 

245 250 255 

Gly Lys Leu Val Glu Pro Val Leu Glu Lys Arg Gly Arg Phe Ala 

260 265 270 

Asp Leu Leu Asp Ala Lys Tyr Leu Arg Thr Thr Leu Gin He Trp 

275 280 285 

Val He Trp Leu Gly He Ser Phe Ala Tyr Tyr Gly Val He Leu 

290 295 300 

Ala Ser Ala Glu Leu Leu Glu Arg Asp Leu Val Cys Gly Ser Lys 

305 310 315 

Ser Asp Ser Ala Val Val Val Thr Gly Gly Asp Ser Gly Glu Ser 

320 325 330 

Gin Ser Pro Cys Tyr Cys His Met Phe Ala Pro Ser Asp Tyr Arg 

335 340 345 

Thr Met He He Ser Thr He Gly Glu He Ala Leu Asn Pro Leu 

350 355 360 

Asn He Leu Gly He Asn Phe Leu Gly Arg Arg Leu Ser Leu Ser 

365 370 375 

He Thr Met Gly Cys Thr Ala Leu Phe Phe Leu Leu Leu Asn He 

380 385 390 

Cys Thr Ser Ser Ala Gly Leu He Gly Phe Leu Phe Met Leu Arg 

395 400 405 

Ala Leu Val Ala Ala Asn Phe Asn Thr Val Tyr He Tyr Thr Ala 

410 415 420 

Glu Val Leu Met Ser Ala Ser He Leu Gly Ala Leu Cys Leu Phe 

425 430 435 

Ser Ser Val Cys Val Val Cys Ala He Ser Ala Phe Thr Leu Pro 

440 445 450 

He Glu Thr Lys Gly Arg Ala Leu Gin Gin He Lys 

455 460 



<210> 50 
<211> 366 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 
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<223> Incyte ID No: 7505145CD1 



<400> 50 



Met Gly Trp Gly Gly Gly Gly Gly Cys Thr Pro Arg Pro Pro 


lie 


1 5 


10 


15 


His Gin Gin Pro Pro Glu Arg Arg Val Val Thr Val Val Phe 


Leu 


20 


25 


30 


Gly Leu Leu Leu Asp Leu Leu Ala Phe Thr Leu Leu Leu Pro 


Leu 


35 


40 


45 


Leu Pro Gly Leu Leu Glu Ser His Gly Arg Ala His Asp Pro 


Leu 


50 


55 


60 


Tyr Gly Ser Trp Gin Gly Gly Val Asp Trp Phe Ala Thr Ala 


He 


65 


70 


75 


Gly Met Pro Val Glu Lys Arg Tyr Asn Ser Val Leu Phe Gly 


Gly 


80 


85 


90 


Leu lie Gly Ser Ala Phe Ser Val Leu 


Gin Phe Leu Cys Ala 


Pro 


95 


100 


105 


Leu Thr Gly Ala Thr Ser Asp Cys Leu Gly Arg Arg Pro Val 


Met 


110 


115 


120 


Leu Leu Cys Leu Met Gly Val Ala Thr Ser Tyr Ala Val Trp 


Ala 


125 


130 


135 


Thr Ser Arg Ser Phe Ala Ala Phe Leu 


Ala Ser Arg Leu lie 


Gly 


140 


145 


150 


Gly lie Ser Lys Gly Asn Val Ser Leu 


Ser Thr Ala He Val 


Ala 


155 


160 


165 


Asp Leu Gly Ser Pro Leu Ala Arg Ser Gin Gly Met Ala Val 


He 


170 


175 


180 


Gly Val Ala Phe Ser Leu Gly Phe Thr Leu Gly Pro Met Leu 


Gly 


185 


190 


195 


Ala Ser Leu Pro Leu Glu Met Ala Pro Trp. Phe Ala Leu Leu 


Phe 


200 


205 


210 


Ala Ala Ser Asp Leu Leu Phe, lie Phe 


Cys Phe Leu Pro Glu 


Thr 


215 


220 


225 


Leu Pro Leu Glu Lys Arg Ala Pro Ser 


He Ala Leu Gly Phe 


Arg 


230 


235 


240 


Asp Ala Ala Asp Leu Leu Ser Pro Leu 


Ala Leu Leu Arg Phe 


Ser 


245 


250 


255 


Ala Val Ala Arg Gly Gin Asp Pro Pro 


Ser Gly Asp Arg Leu 


Ser 


260 


265 


270 


Ser Leu Arg Arg Leu Gly Leu Val Tyr 


Phe Leu Tyr Leu Phe 


Leu 


275 


280 


285 


Phe Ser Gly Leu Glu Tyr Thr Leu Ser 


Phe Leu Thr His Gin 


Arg 


290 


295 


300 


Phe Gin Phe Ser Arg Pro Ser Cys Cys 


Trp Cys Pro Pro Ser 


Ser 


305 


310 


315 


Ser Ser Ala Gly Asp Val Leu Cys Pro Cys Trp Ala Trp Gly 


Cys 


320 


325 


330 


Cys Ser Thr Pro Leu Pro Pro Pro Leu 


Trp Cys Pro Ala Cys 


Pro 


335 


340 


345 


Pro Trp Ser Leu Ala Met Ala His Gin Gly Arg Arg Ala Arg 


Ser 


350 


355 


360 


Trp Val His Cys Ala Ala 






365 







<210> 51 
<211> 295 
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<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 7505162CD1 

<400> 51 

Met Ala Ala Gin Gly Tyr Gly Tyr Tyr 

1 5 
Ala Met Phe Gly Gly Tyr Ser Leu Tyr 
20 

Phe Ser Phe Val Met Pro Ser Leu Val 
35 

Lys Asp Asp Leu Gly Phe lie Thr Ser 
50 

Ala He Ser Lys Phe Val Ser Gly Val 
65 

Ala Arg Trp Leu Phe Ser Ser Gly Leu 
80 

Asn He Phe Phe Ala Trp Ser Ser Thr 
95 

Leu Trp Phe Leu Asn Gly Leu Ala Gin 
110 

Cys Gly Lys Val Leu Arg Lys Trp Phe 
125 

Thr Trp Trp Ala He Leu Ser Thr Ser 
140 

Leu Gly Pro He Leu Ala Thr He Leu 
155 

Arg Ser Thr Leu Ala Leu Ser Gly Ala 
170 

Phe Leu Cys Leu Leu Leu He His Asn 
185 

Leu Arg Asn Leu Asp Pro Met Pro Ser 
200 

Leu Lys Glu Glu Ser Thr Leu Gin Glu 
215 

Leu Trp Val Leu Ser Thr Gly Tyr Leu 
230 

Thr Cys Cys Thr Asp Trp Gly Gin Phe 
245 

Gly Gin Ser Ala Leu Val Gly Gly Thr 
260 

Ser Pro Trp Pro Val Ala Val His Asp 
275 

His Val Pro Leu Pro Gly Asn Ser Asp 
290 

<210> 52 
<211> 229 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 
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Arg 


Thr 


Val He Phe Ser 


10 




15 


Tyr 


Phe Asn Arg Lys Thr 


25 




30 


Glu 


Glu 


He Pro Leu Asp 


40 




45 


Ser 


Gin Ser Ala Ala Tyr 


55 




60 


Leu 


Ser Asp Gin Met Ser 


70 




75 


Leu 


Leu Val Gly Leu Val 


85 




90 


Val 


Pro 


Val Phe Ala Ala 


100 




105 


Gly 


Leu Gly Trp Pro Pro 


115 




120 


Glu 


Pro 


Ser Gin Phe Gly 


130 




135 


Met 


Asn Leu Ala Gly Gly 


145 




150 


Ala 


Gin 


Ser Tyr Ser Trp 


160 




165 


Leu 


Cys Val Val Val Ser 


175 




180 


Glu 


Pro Ala Asp Val Gly 


190 




195 


Glu 


Gly Lys Lys Gly Ser 


205 




210 


Leu 


Leu 


Leu Ser Pro Tyr 


220 




225 


Val 


Val 


Phe Gly Val Lys 


235 




240 


Phe 


Leu 


He Gin Glu Lys 


250 




255 


Val 


Gin 


Leu Arg Glu Pro 


265 




270 


Gly 


Trp 


His Asp Ser Val 


280 




285 


Gin 






295 
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<223> Incyte ID No: 7505469CD1 
<400> 52 

Met Glu Ala Arg Glu Pro Gly Arg Pro Thr Pro Thr Tyr His Leu 
15 10 15 

Val Pro Asn Thr Ser Gin Ser Gin Val Glu Glu Asp Val Ser Ser 
20 25 30 

Pro Pro Gin Arg Ser Ser Glu Thr Met Gin Leu Lys Lys Glu lie 
35 40 45 

Ser Leu Leu Asn Gly Val Ser Leu Val Val Gly Asn Met He Gly 
50 55 60 

Ser Gly He Phe Val Ser Pro Lys Gly Val Leu Val His Thr Ala 
65 70 75 

Ser Tyr Gly Met Ser Leu He Val Trp Ala He Gly Gly Leu Phe 
80 85 90 

Ser Val Val Gly Ala Leu Cys Tyr Ala Glu Leu Gly Thr Thr He 
95 100 105 

Thr Lys Ser Gly Ala Ser Tyr Ala Tyr He Leu Glu Ala Phe Gly 
110 115 120 

Gly Phe He Ala Phe He Arg Leu Trp Val Ser Leu Leu Val Val 
125 130 135 

Glu Pro Thr Gly Gin Ala He He Ala He Thr Phe Ala Asn Tyr 
140 145 150 

He He Gin Pro Ser Phe Pro Ser Cys Asp Pro Pro Tyr Leu Ala 
155 160 165 

Cys Arg Leu Leu Ala Ala Ala Cys He Cys Leu Leu Thr Phe Val 
170 175 180 

Asn Cys Ala Tyr Val Lys Trp Gly Thr Arg Val Gin Asp Thr Phe 
185 190 195 

Thr Tyr Ala Lys Val Val Ala Leu He Ala He He Val Met Gly 
200 205 ' 210 

Leu Val Lys Leu Cys Gin Glu He Cys Pro Trp Pro Leu Gly Phe 
215 220 225 

Leu Cys Gin Leu 



<210> 53 

<211> 637 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7505475CD1 

<400> 53 

Met Asn Met Lys Gin Lys Ser Val Tyr Gin Gin Thr Lys Ala Leu 
15 10 15 

Leu Cys Lys Asn Phe Leu Lys Lys Trp Arg Met Lys Arg Glu Ser 

20 25 30 

Leu Leu Glu Trp Gly Leu Ser He Leu Leu Gly Leu Cys He Ala 

35 40 45 

Leu Phe Ser Ser Ser Met Arg Asn Val Gin Phe Pro Gly Met Ala 

50 55 60 

Pro Gin Asn Leu Gly Arg Val Asp Lys Phe Asn Ser Ser Ser Leu 

65 70 75 
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Met Val Val Tyr Thr Pro He Ser Asn Leu Thr Gin Gin He Met 
80 85 90 

Asn Lys Thr Ala Leu Ala Pro Leu Leu Lys Gly Thr Ser Val He 
95 100 105 

Gly Ala Pro Asn Lys Thr His Met Asp Glu He Leu Leu Glu Asn 
110 115 120 

Leu Pro Tyr Ala Met Gly He He Phe Asn Glu Thr Phe Ser Tyr 
125 130 135 

Lys Leu He Phe Phe Gin Gly Tyr Asn Ser Pro Leu Trp Lys Glu 
140 145 150 

Asp Phe Ser Ala His Cys Trp Asp Gly Tyr Gly Glu Phe Ser Cys 
155 160 165 

Thr Leu Thr Lys Tyr Trp Asn Arg Gly Phe Val Ala Leu Gin Thr 
170 175 180 

Ala He Asn Thr Ala He He Glu He Thr Thr Asn His Pro Val 
185 190 195 

Met Glu Glu Leu Met Ser Val Thr Ala He Thr Met Lys Thr Leu 
200 205 210 

Pro Phe He Thr Lys Asn Leu Leu His Asn Glu Met Phe He Leu 
215 220 225 

Phe Phe Leu Leu His Phe Ser Pro Leu Val Tyr Phe He Ser Leu 
230 235 240 

Asn Val Thr Lys Glu Arg Lys Lys Ser Lys Asn Leu Met Lys Met 
245 250 255 

Met Gly Leu Gin Asp Ser Ala Phe Trp Leu Ser Trp Gly Leu He 
260 265 270 

Tyr Ala Gly Phe He Phe He He Ser He Phe He Thr He He 
275 280 285 

He Thr Phe Thr Gin He He Val Met Thr Gly Phe Met Val He 
290 295 300 

Phe He Pro Phe Phe Leu Tyr Gly Leu Ser Leu Val Ala Leu Val 
305 310 315 

Phe Leu Leu Ser Val Leu Leu Lys Lys Ala Val Leu Thr Asn Leu 
320 325 330 

Val Val Phe Leu Leu Thr Leu Phe Trp Gly Cys Leu Gly Phe Thr 
335 340 345 

Val Phe Tyr Glu Gin Leu Pro Ser Ser Leu Glu Trp He Leu Asn 
350 355 360 

He Cys Ser Pro Phe Ala Phe Thr Thr Gly Met He Gin He He 
365 370 375 

Lys Leu Asp Tyr Asn Leu Asn Gly Val He Phe Pro Asp Pro Ser 
380 385 390 

Gly Asp Ser Tyr Thr Met He Ala Thr Phe Ser Met Leu Leu Leu 
395 400 405 

Asp Gly Leu He Tyr Leu Leu Leu Ala Leu Tyr Phe Asp Lys He 
410 415 420 

Leu Pro Tyr Gly Asp Glu Arg His Tyr Ser Pro Leu Phe Phe Leu 
425 430 435 

Asn Ser Ser Ser Cys Phe Gin His Gin Arg Thr Asn Ala Lys Val 
440 445 450 

He Glu Lys Glu He Asp Ala Glu His Pro Ser Asp Asp Tyr Phe 
455 460 465 

Glu Pro Val Ala Pro Glu Phe Gin Gly Lys Glu Ala He Arg He 
470 475 480 

Arg Asn Val Lys Lys Glu Tyr Lys Gly Lys Ser Gly Lys Val Glu 
485 490 495 
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Ala Leu Lys Gly Leu Leu Phe Asp He 
500 

Ala He Leu Gly His Ser Gly Ala Gly 
515 

He Leu Asn Gly Leu Ser Val Pro Thr 
530 

Tyr Asn Lys Asn Leu Ser Glu Met Gin 
545 

Lys He Thr Gly Val Cys Pro Gin Phe 
560 

Leu Thr Val Lys Glu Asn Leu Ser Leu 
575 

He His Leu Lys Glu Val Glu Gin Glu 
590 

Glu Leu Asp Met Gin Asn He Gin Asp 
605 

Ser Glu Gly Gin Lys Arg Lys Leu Thr 
620 

Gly Asp Pro Gin He Glu Lys 
635 

<210> 54 
<211> 90 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7505568CD1 

<400> 54 

Met Ala Arg Lys Gin Asn Arg Asn Ser 

1 5 
Pro Leu Thr Asp Asp Thr Ser His Ala 
20 

Arg Ala Leu Leu Glu Cys Asp His Leu 
35 

Gly Arg Arg Arg Lys Asp Trp Ser Cys 
50 

Leu Ala Gly Ala Phe Gly Ser Ser Phe 
65 

Ser Val Val Asn Ala Pro Thr Pro Glu 
80 



<210> 55 
<211> 327 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7506953CD1 

<400> 55 

Met Lys Thr Lys Leu Asn lie Tyr Asn 
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Tyr 


Glu Gly Gin He 


Thr 


505 




510 


Lys 


Ser Ser Leu Leu 


Asn" 


520 




525 


Glu 


Gly Ser Val Thr 


He 


535 




540 


Asp 


Leu Glu Glu He 


Arg 


550 




555 


Asn 


Val Gin Phe Asp 


He 


565 




570 


Phe 


Ala Lys He Lys 


Gly 


580 




585 


Val 


Gin Arg He Leu 


Leu 


595 




600 


Asn 


Leu Ala Lys His 


Leu 


610 




615 


Phe 


Gly He Thr He 


Leu 


625 




630 


Lys 


Glu Leu Gly Leu 


Val 


10 




15 


Gly 


Pro Pro Gly Pro 


Gly 


25 




30 


Arg 


Ser Gly Val Pro 


Gly 


40 




45 


Ser 


Leu Leu Val Ala 


Ser 


55 




60 


Leu 


Tyr Gly Tyr Asn 


Leu 


70 




75 


Ala 


His Phe Ala Gly 


Gin 


85 




90 


Met 


Gin Phe Leu Leu 


Phe 
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1 5 10 15 

Val Phe Leu Val Trp Asp Pro Ala Arg Leu Val Leu Ala Asn He 
20 25 30 

Gin Glu Asp Glu Ala Lys Asn Asn He Thr He Phe Thr Arg He 
35 40 45 

Leu Asp Arg Leu Leu Asp Gly Tyr Asp Asn Arg Leu Arg Pro Gly 
50 55 60 

Leu Gly Asp Ala Tyr Thr Thr Ser Glu Val Thr Tyr He Trp Thr 
65 70 75 

Tyr Asn Ala Ser Asp Ser Val Gin Val Ala Pro Asp Gly Ser Arg 
80 85 90 

Leu Asn Gin Tyr Asp Leu Leu Gly Gin Ser He Gly Lys Glu Thr 
95 100 105 

He Lys Ser Ser Thr Gly Glu Tyr Thr Val Met Thr Ala His Phe 
110 H5 120 

His Leu Lys Arg Lys He Gly Tyr Phe Val He Gin Thr Tyr Leu 
125 130 135 

Pro Cys He Met Thr Val He Leu Ser Gin Val Ser Phe Trp Leu 
140 145 150 

Asn Arg Glu Ser Val Pro Ala Arg Thr Val Phe Gly Val Thr Thr 
155 160 165 

Val Leu Thr Met Thr Thr Leu Ser He Ser Ala Arg Asn Ser Leu 
170 175 180 

Pro Lys Val Ala Tyr Ala Thr Ala Met Asp Trp Phe He Ala Val 
185 190 195 

Cvs Tyr Ala Phe Val Phe Ser Ala Leu He Glu Phe Ala Thr Val 
200 205 210 

Asn Tyr Phe Thr Lys Arg Gly Trp Ala Trp Asp Gly Lys Ser Val 
215 220 225 

Val Asn Asp Lys Lys Lys Glu Lys Ala Ser Val Met He Gin Asn 
230 235 240 

Asn Ala Tyr Ala Val Ala Val Ala Asn Tyr Ala Pro Asn Leu Ser 
245 250 255 

Lys Asp Pro Val Leu Ser Thr lie Ser Lys Ser Ala Thr Thr Pro 
260 265 270 

Glu Pro Asn Lys Lys Pro Glu Asn Lys Pro Ala Glu Ala Lys Lys 
275 280 285 

Thr Phe Asn Ser Val Ser Lys He Asp Arg Met Ser Arg He Val 
290 295 300 

Phe Pro Val Leu Phe Gly Thr Phe Asn Leu Val Tyr Trp Ala Thr 
305 310 315 

Tyr Leu Asn Arg Glu Pro Val Leu Gly Val Ser Pro 
320 325 

<210> 56 
<211> 40 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 751017 6CD1 

<400> 56 

Met Lys Phe Phe Ser Tyr He Leu Val Tyr Arg Arg Phe Leu Phe 
15 10 I 5 
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Val Val Phe Thr Val Leu Val Leu Leu Pro Leu Pro lie Val Leu 
20 25 30 

His Thr Lys Leu He Leu Thr Phe Pro Arg 
35 40 



<210> 57 

<211> 104 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7510541CD1 



<400> 57 

Met Glu Ala Pro Leu Gin Thr Glu Met Val Glu Leu Val Pro Asn 
15 10 15 

Gly Lys His Ser Glu Gly Leu Leu Pro Val He Thr Pro Met Ala 

20 25 30 

Gly Asn Gin Arg Val Glu Asp Pro Ala Arg Ser Cys Met Glu Gly 

35 40 45 

Lys Ser Phe Leu Gin Lys Ser Pro Ser Lys Glu Pro His Phe Thr 

50 55 60 

Asp Phe Glu Gly Lys Thr Ser Phe Gly Met Ser Val Phe Asn Leu 

65 70 75 

Ser Asn Ala He Met Gly Ser Gly He Leu Gly Leu Ala Tyr Ala 

80 85 90 

Met Ala Asn Thr Gly He He Leu Phe Leu His Pro Cys Leu 

95 100 



<210> 58 

<211> 296 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_JEeature 

<223> Incyte ID No: 7510923CD1 



<400> 58 

Met Glu Ala Pro Leu Gin Thr Glu Met Val Glu Leu Val Pro Asn 
15 10 15 

Gly Lys His Ser Glu Gly Leu Leu Pro Val He Thr Pro Met Ala 

20 25 30 

Gly Asn Gin Arg Val Glu Asp Pro Ala Arg Ser Cys Met Glu Gly 

35 40 45 

Lys Ser Phe Leu Gin Lys Ser Pro Ser Lys Glu Pro His Phe Thr 

50 55 60 

Asp Phe Glu Gly Lys Thr Ser Phe Gly Met Ser Val Phe Asn Leu 

65 70 75 

Ser Asn Ala He Met Gly Ser Gly He Leu Gly Leu Ala Tyr Ala 

80 85 90 

Met Ala Asn Thr Gly He He Leu Phe Leu Phe Leu Leu Thr Ala 

95 100 105 

Val Ala Leu Leu Ser Ser Tyr Ser He His Leu Leu Leu' Lys Ser 
110 115 120 
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Ser Gly Val Val Gly He Arg Ala Tyr Glu Gin Leu Gly Tyr Arg 
125 130 135 

Ala Phe Gly Thr Pro Gly Lys Leu Ala Ala Ala Leu Ala He Thr 
140 145 150 

Leu Gin Asn He Gly Ala Met Ser Ser Tyr Leu Tyr He He Lys 
155 160 165 

Ser Glu Leu Pro Leu Val He Gin Thr Phe Leu Asn Leu Glu Glu 
170 175 180 

Lys Thr Ser Asp Trp Tyr Met Asn Gly Asn Tyr Leu Val He Leu 
185 190 195 

Val Ser Val Thr He He Leu Pro Leu Ala Leu Met Arg Gin Leu 
200 205 210 

Gly Tyr Leu Gly Tyr Ser Ser Gly Phe Ser Leu Ser Cys Met Val 
215 220 225 

Phe Phe Leu He Ala Val He Tyr Lys Lys Phe His Val Pro Cys 
230 235 240 

Pro Leu Pro Pro Asn Phe Asn Asn Thr Thr Gly Asn Phe Ser His 
245 250 255 

Val Glu He Val Lys Glu Lys Val Gin Leu Gin Val Glu Pro Glu 
260 265 270 

Ala Ser Ala Phe Cys Thr Pro Ser Tyr Phe Thr Leu Asn Ser Gin 
275 280 285 

Val Leu Thr Gly Gin Gly Lys Ala Gly Ala Gin 
290 295 



<210> 59 

<211> 1364 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7510984CD1 



<400> 59 

Met Pro Leu Ala Phe Cys Gly Ser Glu Asn His Ser Ala Ala Tyr 
15 10 15 

Arg Val Asp Gin Gly Val Leu Asn Asn Gly Cys Phe Val Asp Ala 

20 25 30 

Leu Asn Val Val Pro His Val Phe Leu Leu Phe He Thr Phe Pro 

35 40 45 

lie Leu Phe He Gly Trp Gly Ser Gin Ser Ser Lys Val His He 

50 55 60 

His His Ser Thr Trp Leu His Phe Pro Gly His Asn Leu Arg Trp 

65 70 75 

He Leu Thr Phe Met Leu Leu Phe Val Leu Val Cys Glu He Ala 

80 85 90 

Glu Gly He Leu Ser Asp Gly Val Thr Glu Ser His His Leu His 

95 100 ' 105 

Leu Tyr Met Pro Ala Gly Met Ala Phe Met Ala Ala Val Ala Ser 
110 H5 120 

Val Val Tyr Tyr His Asn He Glu Thr Ser Asn Phe Pro Lys Leu 
125 130 135 

Leu He Ala Leu Leu Val Tyr Trp Thr Leu Ala Phe He Thr Lys 
140 145 150 

Thr He Lys Phe Val Lys Phe Leu Asp His Ala He Gly Phe Ser 
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155 160 165 

Gin Leu Arg Phe Cys Leu Thr Gly Leu Leu Val He Leu Tyr Gly 

170 175 180 

Met Leu Leu Leu Val Glu Val Asn Val He Arg Val Arg Arg Tyr 

185 190 195 

He Phe Phe Lys Thr Pro Arg Glu Val Lys Pro Pro Glu Asp Leu 

200 205 210 

Gin Asp Leu Gly Val Arg Phe Leu Gin Pro Phe Val Asn Leu Leu 

215 220 225 

Ser Lys Gly Thr Tyr Trp Trp Met Asn Ala Phe He Lys Thr Ala 

230 235 240 

His Lys Lys Pro He Asp Leu Arg Ala He Gly Lys Leu Pro He 

245 250 255 

Ala Met Arg Ala Leu Thr Asn Tyr Gin Arg Leu Cys Glu Ala Phe 

260 265 270 

Asp Ala Gin Val Arg Lys Asp He Gin Gly Thr Gin Gly A^a Arg 

275 280 285 

Ala He Trp Gin Ala Leu Ser His Ala Phe Gly Arg Arg Leu Val 

290 295 300 

Leu Ser Ser Thr Phe Arg He Leu Ala Asp Leu Leu Gly Phe Ala 

305 310 315 

Gly Pro Leu Cys He Phe Gly He Val Asp His Leu Gly Lys Glu 

320 325 330 

Asn Asp Val Phe Gin Pro Lys Thr Gin Phe Leu Gly Val Tyr Phe 

335 340 345 

Val Ser Ser Gin Glu Phe Leu Ala Asn Ala Tyr Val Leu Ala Val 

350 355 360 

Leu Leu Phe Leu Ala Leu Leu Leu Gin Arg Thr Phe Leu Gin Ala 

365 370 375 

Ser Tyr Tyr Val Ala He Glu Thr Gly He Asn Leu Arg Gly Ala 

380 385 390 

He Gin Thr Lys He Tyr Asn Lys He Met His Leu Ser Thr Ser 

395 400 405 

Asn Leu Ser Met Gly Glu Met Thr' Ala Gly Gin He Cys Asn Leu 

410 415 420 

Val Ala He Asp Thr Asn Gin Leu Met Trp Phe Phe Phe Leu Cys 

425 430 435 

Pro Asn Leu Trp Ala Met Pro Val Gin He He Val Gly Val He 

440 445 450 

Leu Leu Tyr Tyr He Leu Gly Val Ser Ala Leu He Gly Ala Ala 

455 460 465 

Val He He Leu Leu Ala Pro Val Gin Tyr Phe Val Ala Thr Lys 

470 475 480 

Leu Ser Gin Ala Gin Arg Ser Thr Leu Glu Tyr Ser Asn Glu Arg 

485 490 495 

Leu Lys Gin Thr Asn Glu Met Leu Arg Gly He Lys Leu Leu Lys 

500 505 510 

Leu Tyr Ala Trp Glu Asn He Phe Arg Thr Arg Val Glu Thr Thr 

515 520 525 

Arg Arg Lys Glu Met Thr Ser Leu Arg Ala Phe Ala He Tyr Thr 

530 535 540 

Ser He Ser He Phe Met Asn Thr Ala He Pro He Ala Ala Val 

545 550 555 

Leu He Thr Phe Val Gly His Val Ser Phe Phe Lys Glu Ala Asp 

560 565 570 

Phe Ser Pro Ser Val Ala Phe Ala Ser Leu Ser Leu Phe His He 
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575 580 585 

Leu Val Thr Pro Leu Phe Leu Leu Ser Ser Val Val Arg Ser Thr 
590 595 600 

Val Lys Ala Leu Val Ser Val Gin Lys Leu Ser Glu Phe Leu Ser 
605 610 615 

Ser Ala Glu He Arg Glu Glu Gin Cys Ala Pro His Glu Pro Thr 
620 625 630 

Pro Gin Gly Pro Ala Ser Lys Tyr Gin Ala Val Pro Leu Arg Val 
635 640 645 

Val Asn Arg Lys Arg Pro Ala Arg Glu Asp Cys Arg Gly Leu Thr 
650 655 660 

Gly Pro Leu Gin Ser Leu Val Pro Ser Ala Asp Gly Asp Ala Asp 
665 670 675 

Asn Cys Cys Val Gin He Met Gly Gly Tyr Phe Thr Trp Thr Pro 
680 685 690 

Asp Gly He Pro Thr Leu Ser Asn He Thr He Arg He Pro Arg 
695 700 705 

Gly Gin Leu Thr Met He Val Gly Gin Val Gly Cys Gly Lys Ser 
710 715 720 

Ser Leu Leu Leu Ala Ala Leu Gly Glu Met Gin Lys Val Ser Gly 
725 730 735 

Ala Val Phe Trp Ser Ser Ser Leu Pro Asp Ser Glu He Gly Glu 
740 745 750 

Asp Pro Ser Pro Glu Arg Glu Thr Ala Thr Asp Leu Asp He Arg 
755 760 765 

Lys Arg Gly Pro Val Ala Tyr Ala Ser Gin Lys Pro Trp Leu Leu 
770 775 780 

Asn Ala Thr Val Glu Glu Asn He He Phe Glu Ser Pro Phe Asn 
785 790 795 

Lys Gin Arg Tyr Lys Met Val He Glu Ala Cys Ser Leu Gin Pro 
800 805 810 

Asp He Asp He Leu Pro His Gly Asp Gin Thr Gin He Gly Glu 
815 820 825 

Arg Gly He Asn Leu Ser Gly Gly Gin Arg Gin Arg He Ser Val 
830 835 840 

Ala Arg Ala Leu Tyr Gin His Ala Asn Val Val Phe Leu Asp Asp 
845 850 855 

Pro Phe Ser Ala Leu Asp He His Leu Ser Asp His Leu Met Gin 
860 865 870 

Ala Gly He Leu Glu Leu Leu Arg Asp Asp Lys Arg Thr Val Val 
875 880 885 

Leu Val Thr His Lys Leu Gin Tyr Leu Pro His Ala Asp Trp He 
890 895 900 

He Ala Met Lys Asp Gly Thr He Gin Arg Glu Gly Thr Leu Lys 
905 910 915 

Asp Phe Gin Arg Ser Glu Cys Gin Leu Phe Glu His Trp Lys Thr 
920 925 930 

Leu Met Asn Arg Gin Asp Gin Glu Leu Glu Lys Glu Thr Val Thr 
935 940 945 

Glu Arg Lys Ala Thr Glu Pro Pro Gin Gly Leu Ser Arg Ala Met 
950 955 960 

Ser Ser Arg Asp Gly Leu Leu Gin Asp Glu Glu Glu Glu Glu Glu 
965 970 975 

Glu Ala Ala Glu Ser Glu Glu Asp Asp Asn Leu Ser Ser Met Leu 
980 985 990 

His Gin Arg Ala Glu He Pro Trp Arg Ala Cys Ala Lys Tyr Leu 
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995 

Ser Ser Ala Gly lie Leu Leu Leu 
1010 

Leu Leu Lys His Met Val Leu Val 
1025 

Lys Trp Thr Asp Ser Ala Leu Thr 
1040 

Cys Ser Leu Ser Gin Glu Cys Thr 
1055 

Met Val Phe Thr Val Leu Cys Ser 
1070 

Val Thr Ser Val Thr Val Glu Trp 
1085 

Arg Leu His Arg Ser Leu Leu Asn 
1100 

Arg Phe Phe Glu Thr Thr Pro Leu 
1115 

Ser Ser Asp Cys Asn Thr lie Asp 
1130 

Glu Cys Leu Ser Arg Ser Thr Leu 
1145 

Val He Ser Tyr Val Thr Pro Val 
1160 

Leu Ala He Val Cys Tyr Phe He 
1175 

Ser Arg Asp Leu Gin Gin Leu Asp 
1190 

Leu Ser His Phe Ala Glu Thr Val 
1205 

Ala Phe Arg Tyr Glu Ala Arg Phe 
1220 

Thr Asp Ser Asn Asn He Ala Ser 
1235 

Arg Trp Leu Glu Val Arg Met Glu 
1250 

Leu He Ala Ala Val Thr Ser He 
1265 

Leu Ser Ala Gly Leu Val Gly Leu 
1280 

Val Ser Asn Tyr Leu Asn Trp Met 
1295 

Glu Leu Gin Leu Gly Ala Val Lys 
1310 

Thr Glu Ala Glu Ser Tyr Glu Gly 
1325 

Glu Arg Gly Gly Glu Glu Ser Lys 
1340 

Gly His Lys Gly Ala Trp Gly Trp 
1355 

<210> 60 
<211> 895 
<212> DNA 

<213> Homo sapiens 
<220> 



1000 


1005 


Ser Leu 


Leu Val Phe Ser Gin 


1015 


1020 


Ala He 


Asp Tyr Trp Leu Ala 


1030 


1035 


Leu Thr 


Pro Ala Ala Arg Asn 


1045 


1050 


Leu Asp 


Gin Thr Val Tyr Ala 


1060 


1065 


Leu Gly 


He Val Leu Cys Leu 


1075 


1080 


Thr Gly 


Leu Lys Val Ala Lys 


1090 


1095 


Arg He 


He Leu Ala Pro Met 


1105 


1110 


Gly Ser 


He Leu Asn Arg Phe 


1120 


1125 


Gin His 


He Pro Ser Thr Leu 


1135 


1140 


Leu Cys 


Val Ser Ala Leu Ala 


1150 


1155 


Phe Leu 


Val Ala Leu Leu Pro 


1165 


1170 


Gin Lys 


Tyr Phe Arg Val Ala 


1180 


1185 


Asp Thr 


Thr Gin Leu Pro Leu 


1195 


1200 


Glu Gly 


Leu Thr Thr He Arg 


1210 


1215 


Gin Gin 


Lys Leu Leu Glu Tyr 


1225 


1230 


Leu Phe 


Leu Thr Ala Ala Asn 


1240 


1245 


Tyr He 


Gly Ala Cys Val Val 


1255 


1260 


Ser Asn 


Ser Leu His Arg Glu 


1270 


1275 


Gly Leu 


Thr Tyr Ala Leu Met 


1285 


1290 


Val Arg 


Asn Leu Ala Asp Met 


1300 


1305 


Arg He 


His Gly Leu Leu Lys 


1315 


1320 


Leu Leu 


Gly Glu Arg Leu Arg 


1330 


1335 


Glu Glu 


Cys Val Trp Val Gly 


1345 


1350 


Gly Gly 


Thr Phe Gly Tyr 


1360 
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<221> misc_f eature 

<223> Incyte ID No: 7509332CB1 

<400> 60 

gagcacggag attcggcacg aggaggaaaa gcagcaaacc agatggagaa ggaaggccaa 60 

aaagataagt agaaagctgg accatgagct ccttgagggc agggactgaa tggttactat 120 

gtccccaggg cccagcatga ccttctcctg gattcctcat cttccttctg tgacctgtgt 180 

ctccatcagt ttctcctccg gcatcttttt cttacaggat tcttacctca ggaatagatg 240 

gacatggcct ggcagatgat gcagctgctg cttctggctt tggtgactgc tgcggggagt 300 

gcccagccca ggagtgcgcg ggccaggacg gacctgctca atgtctgcat gaacgccaag 360 

caccacaaga cacagcccag ccccgaggac gagctgtatg gccagtgcag tccctggaag 420 

aagaatgcct gctgcacggc cagcaccagc caggagctgc acaaggacac ctcccgcctg 480 

tacaacttta actgggatca ctgtgagcgc tggtgggagg actgtcgcac ctcctacacc 540 

tgcaaaagca actggcacaa aggctggaat tggacctcag ggattaatga gtgtccggcc 600 

ggggccctct gcagcacctt tgagtcctac ttccccactc cagccgccct ttgtgaaggc 660 

ctctggagcc actccttcaa ggtcagcaac tatagtcgag ggagcggccg ctgcatccag 720 

atgtggtttg actcagccca gggcaacccc aatgaggagg tggccaagtt ctatgctgcg 780 

gccatgaatg ctggggcccc gtctcgtggg attattgatt cctgatccaa gaagggtcct 840 

ctggggttct tccaacaacc tattctaata gacaaatcca catgaaaaaa aaaaa 895 

<210> 61 

<211> 1623 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mi sc_f eature 

<223> Incyte ID No: 7509102CB1 

<400> 61 

cagcaaagcc ctgacctgac ctcctgatgc tcaggagaag ccatgggccc ctcctgtcct 60 

gtgttcctgt ccttcacaaa gctcggcctg tggtggctcc ttctgacccc agcaggtgga 120 

gaggaagcta agcgcccacc tcccagggct cctggagacc cactctcctc tcccagtccc 180 

acggcattgc cgcagggagg ctcgcatacc gagactgagg accggctctt caaacacctc 240 
ttccggggct acaaccgctg ggcgcgcccg gtgcccaaca cttcagacgt ggatgagaag 300 

aaccaaatga tgaccaccaa cgtctggcta aaacaggagt ggagcgacta caaactgcgc 360 

tggaacccca ctgattttgg caacatcaca tctctcaggg tcccttctga gatgatctgg 420 

atccccgaca ttgttctcta caacaaaact gcaagatgaa gtttggctcc tggacttatg 480 

acaaggccaa gatcgacctg gagcagatgg agcagactgt ggacctgaag gactactggg 540 

agagcggcga gtgggccatc gtcaatgcca cgggcaccta caacagcaag aagtacgact 600 

gctgcgccga gatctacccc gacgtcacct acgccttcgt catccggcgg ctgccgctct 660 

tctacaccat caacctcatc atcccctgcc tgctcatctc ctgcctcact gtgctggtct 720 

tctacctgcc ctccgactgc ggcgagaaga tcacgctgtg catttcggtg ctgctgtcac 780 

tcaccgtctt cctgctgctc atcactgaga tcatcccgtc cacctcgctg gtcatcccgc 840 

tcatcggcga gtacctgctg ttcaccatga tcttcgtcac cctgtccatc gtcatcaccg 900 

tcttcgtgct caatgtgcac caccgctccc ccagcaccca caccatgccc cactgggtgc 960 

ggggggccct tctgggctgt gtgccccggt ggcttctgat gaaccggccc ccaccacccg 1020 

tggagctctg ccacccccta cgcctgaagc tcagcccctc ttatcactgg ctggagagca 1080 
acgtggatgc cgaggagagg gaggtggtgg tggaggagga ggacagatgg gcatgtgcag 1140 

gtcatgtggc cccctctgtg ggcaccctct gcagccacgg ccacctgcac tctggggcct 1200 

caggtcccaa ggctgaggct ctgctgcagg agggtgagct gctgctatca ccccacatgc 1260 
agaaggcact ggaaggtgtg cactacattg ccgaccacct gcggtctgag gatgctgact 1320 

cttcggtgaa ggaggactgg aagtatgttg ccatggtcat cgacaggatc ttcctctggc 1380 

tgtttatcat cgtctgcttc ctggggacca tcggcctctt tctgcctccg ttcctagctg 1440 

gaatgatctg actgcacctc cctcgagctg gctcccaggg caaaggggag ggttcttgga 1500 

tgtggaaggg ctttgaacaa tgtttagatt tggagatgag cccaaagtgc cagggagaac 1560 
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agccaggtga ggtgggaggt tggagagcca ggtgaggtct ctctaagtca ggctggggtt 1620 
gaa 1623 

<210> 62 
<211> 1802 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7509132CB1 

<400> 62 

gaatcagctg tgacgagccg catcagctta tactggcgca gtgtgctgga atgcgtcctt 60 

caggatcggt tctttcatct tcgccgcccc tgcgcgtcca gctcttctaa gacgagatgc 120 

cgtcgggctt ccaacagata ggctccgaag atggggaacc ccctcagcag cgagtgactg 180 

ggaccctggt ccttgctgtg ttctctgcgg tgcttggctc cctgcagttt gggtacaaca 240 

ttggggtcat caatgcccct cagaaggtga ttgaacagag ctacaatgag acgtggctgg 300 

ggaggcaggg gcctgaggga cccagctcca tccctccagg caccctcacc accctctggg 360 

ccctctccgt ggccatcttt tccgtgggcg gcatgatttc ctccttcctc attggtatca 420 

tctctcagtg gcttggaagg aaaagggcca tgctggtcaa caatgtcctg gcggtgctgg 480 

ggggcagcct catgggcctg gccaatgctg ctgcctccta tgaaatgctc atccttggac 540 

gattcctcat tggcgcctac tcaggtgctg ggcttggagt ccctcctggg cactgccagc 600 

ctgtggccac tgctcctggg cctcacagtg ctacctgccc tcctgcagct ggtcctgctg 660 

cccttctgtc ccgagagccc ccgctacctc tacatcatcc agaatctcga ggggcctgcc 720 

agaaagagtc tgaagcgcct gacaggctgg gccgatgttt ctggagtgct ggctgagctg 780 

aaggatgaga agcggaagct ggagcgtgag cggccactgt ccctgctcca gctcctgggc 840 

agccgtaccc accggcagcc cctgatcatt gcggtcgtgc tgcagctgag ccagcagctc 900 

tctggcatca atgctgtttt ctattattcg accagcatct tcgagacagc aggggtaggc 960 

cagcctgcct atgccaccat aggagctggt gtggtcaaca cagtcttcac cttggtctcg 1020 

gtgttgttgg tggagcgggc ggggcgccgg acgctccatc tcctgggcct ggcgggcatg 1080 

tgtggctgtg ccatcctgat gactgtggct ctgctcctgc tggagcgagt tccagccatg 1140 

agctacgtct ccattgtggc catctttggc ttcgtggcat tttttgagat tggccctggc 1200 

cccattcctt ggttcatcgt ggccgagctc ttcagccagg gaccccgccc ggcagccatg 1260 

gctgtggctg gtttctccaa ctggacgagc aacttcatca ttggcatggg tttccagtat 1320 

gttgcggagg ctatggggcc ctacgtcttc cttctatttg cggtcctcct gctgggcttc 1380 

ttcatcttca ccttcttaag agtacctgaa actcgaggcc ggacgtttga ccagatctca 1440 

gctgccttcc accggacacc ctctctttta gagcaggagg tgaaacccag cacagaactt 1500 

gagtatttag ggccagatga gaacgactga ggggccaggc aggggtggga gagccagctc 1560 

tctctacccg gcccagagac cccttccttt cctctgcagc actttaaccc tctcttccct 1620 

attatttccg ggtggaaaag aatccctgca gcctggtaga attgggaagc tgggggaagg 1680 

gtggtctgag caccccctca ttcccctcgt gtgactctct tggattattt atgtgttgtg 1740 

gtttggccgt ggccatcagg gtgggccact ctcccctccc tcttccttcc cccatcccct 1800 

tt 1802 

<210> 63 
<211> 2139 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mi sc_f eature 

<223> Incyte ID No: 7509136CB1 

<400> 63 

agcctttggt gaagattgga tcgaaatcag accaatggac aagctctggc cgtgggtggt 60 
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ggacgggcct ggagtagaac tggaacgaga aggatgaaga gatgagcaca aaggtgtact 120 
tagacctgga gtggactgac tacaggctga gctgggaccc tgcggagcac gacggcatcg 180 
attcgctccg catcacggcg gaatccgtgt ggctccctga cgtggtgcta ctgaacaaca 240 
atgatgggaa ttttgacgtg gctctggaca ttagcgtcgt ggtgtcctcc gacggctccg 300 
tgcgttggca acccccgggc atctatcgca gcagctgcag catccaggtc acctacttcc 360 
ccttcgactg gcagaattgc actatggtgt tcagctccta cagctacgac agctcggagg 420 
tcagcctgca gacaggcctg ggtcctgacg ggcaagggca tcaggaaatc cacattcatg 480 
aagggacttt cattgagaat ggccagtggg agattatcca caagccctct cggctaatcc 540 
agcctccagg cgatcctagg ggagggaggg aaggacagcg ccaggaagtc atcttctacc 600 
tcatcatccg ccgcaagcct ctcttctacc tggtcaacgt cattgcccca tgcatcctca 660 
tcactcttct ggccatcttc gtcttctacc tgccaccaga tgcagtcatc cttagtgtcg 720 
tggttctcaa cctgcaccac cgctcacccc acacccacca aatgcccctt tgggtccgtc 780 
agatcttcat tcacaaactt ccgctgtacc tgcgtctaaa aaggcccaaa cccgagagag 840 
acctgatgcc ggagccccct cactgttctt ctccaggaag tggctggggt cggggaacag 900 
atgaatattt catccggaag ccgccaagtg attttctctt ccccaaaccc aataggttcc 960 
agcctgaact gtctgcccct gatctgcggc gatttatcga tggtccaaac cgggctgtgg 1020 
ccctgcttcc ggagctacgg gaggtcgtct cctctatcag ctacatcgct cgacagctgc 1080 
aggaacagga ggaccacgat gcgctgaagg aggactggca gtttgtggcc atggtagtgg 1140 
accgcctctt cctgtggact ttcatcatct tcaccagcgt tgggacccta gtcatcttcc 1200 
tggacgccac gtaccacttg ccccctccag acccctttcc ttgaagactg gagggttgag 1260 
acccaggccc cctgccagtt gaagtgagag tttggtgata ctgtcaagcc ccatccttct 1320 
ctgcctctta actccttcac gaggaatctg ggcctcttat ttcgtttctg gggactgcat 1380 
tggactgagg gctgggtagg caggtgtctt ggacccacct gaaatgcagt atcatctgat 1440 
ttactctttg ggatcttgaa gaagctcttt tgggtatcaa cacctaggtc gccagtgaaa 1500 
tagaacacag aacaggaact agattataag ccttatgagg tcaagaaatg tgacttggcc 1560 
gggcgcggtg gctcacgcct gtaatcccag cactttggga ggccaaggcg ggcggatcac 1620 
ctgaggtcgg gagtttgaga ccagcccgac caacatggag aaaccctgtc tctactaaaa 1680 
atacaaaatt agccaggtgt ggtggtacat gcctgtaatc ccagctacta gggaggctga 1740 
ggcaggagaa tcactcgaac ccgggaggca gaggttgcag tgagtcaaga tcacgccatt 1800 
gcactccagc ctgggcaaca agagcgaaac tccatctcaa aaaaaaaaaa gtgtcttgtt 1860 
cactgcagca tgcccagtgc cacgcacagg tgctgaccta tagtaagtgc ttaacatttg 1920 
ttgaataggg gaaagaaatt tccggaagta aatacagcaa ttaataatgt ttataagctg 1980 
ggcatggtcg ctcaggccag taatcccagt gacttaggag gctgaggtgg gaggattact 2040 
tgaaacccac agtttgagac cagcctgggc aacatagtga gaccctgtct ctaaaaaaat 2100 
aaaaatagaa ataaagtagt gcttattgtt tgcaaaaaa 2139 

<210> 64 
<211> 1461 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7509178CB1 

<400> 64 

cacaagctcc ggtagcccat ggagccctgg cctctcctcc tgctctttag cctttgctca 60 
gctggcctcg tcctgggctc cgaacatgag acccgtctgg tggcaaagct atttaaagac 120 
tacagcagcg tggtgcggcc agtggaagac caccgccagg tcgtggaggt caccgtgggc 180 
ctgcagctga tacagctcat caatgtggat gaagtaaatc agatcgtgac aaccaatgtg 240 
cgtctgaaac agaactgcag catgaagctg ggcacctgga cctacgacgg ctctgtcgtg 300 
gccatcaacc cggaaagcga ccagccagac ctgagcaact tcatggagag cggggagtgg 3 60 
gtgatcaagg agtcccgggg ctggaagcac tccgtgacct attcctgctg ccccgacacc 420 
ccctacctgg acatcaccta ccacttcgtc atgcagcgcc tgcccctcta cttcatcgtc 480 
aacgtcatca tcccctgcct gctcttctcc ttcttaactg gcctggtatt ctacctgccc 540 
acagactcag ' gggagaagat gactctgagc atctctgtct tactgtcttt gactgtgttc 600 
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cttctggtca tcgtggagct gatcccctcc 
tacatgctgt tcaccatggt gttcgtcatt 
aacacacacc accgctcacc cagcacccat 
atcgacacta tcccaaatat catgtttttc 
caagacaaaa agatttttac agaagacatt 
cctccaccca tgggcttcca ctctcccctg 
gagggcatca agtacatcgc agagaccatg 
gcagagtgga agtacgttgc aatggtgatg 
gtttgcatca tcggaaccct agccgtgttt 
ggatgagcag aaaatgagct gagcttagct 
gcaggagagg aagatttgtc tacttgctcc 
tacttattat tgatgataag atttaccttt 
tattgcttct ccctttagtt ctgctgtctc 
aactaatcac taaaaaaagt gttcatttcc 
cgaggtttct gttgcattgc a 

<210> 65 
<211> 738 
<212> DNA 

<213> Homo sapiens 



acgtccagtg ctgtgccctt gattggaaaa 660 
gcctccatca tcatcactgt catcgtcatc 720 
gtcatgccca actgggtgcg gaaggttttt 780 
tccacaatga aaagaccatc cagagaaaag 840 
gatatctctg acatttctgg aaagccaggg 900 
atcaaacacc ccgaggtgaa aagtgccatc 960 
aagtcagacc aggagtctaa caatgcggcg 1020 
gaccacatac tcctcggagt cttcatgctt 1080 
gcaggtcgac tcattgaatt aaatcagcaa 1140 
ctgccctgga acctaccaga gcagagaagg 1200 
actcgcactt atcaaacgtg ttatattcca 1260 
atgtaagttt atggccttga agtgttttca 1320 
cctgaagagt gaaccctctt tagtaaatga 1380 
agtgtctgga agagtttttg ccaggataac 1440 

1461 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7509214CB1 

<400> 65 

gacagcggcc tggctaactc ctgccaggca 
gagctcccct gaacagcagc tgcagcagcc 
ttcacgctgg cgcccagcca tggggtacgg 
cttcacagag aaggctgtgg ttcccctggg 
ggcatggccc cccatcttct ttggtgcccg 
gctggtcagt ggggcggcgg cagccactac 
cgcccgcctg ctctacccct acctggcctg 
cgtatggcgg gacaaccatg gctggcgtgg 
cccaccaggg actgcagctg caccagcagg 
acgctttcat gaccactggg cctgctagtc 
gcttcagagg tggccccacc tgagccccca 
gcttagagca tgttcttgga acatggaatt 
ttaaaaaaaa aaaaaaaa 

<210> 66 
<211> 2106 
<212> DNA 

<213> Homo sapiens 



gtgcccttcc cggagcgtgc cctcgccgct 60 
atggccccgc cctgggtgcc cgccatgggc 120 
ctcctacctg gtctggaaag agctgggagg 180 
cctctacact gggcagctgg ccctgaactg 240 
acaaatgggc tgggccttgg tggatctcct 300 
cgtggcctgg taccaggtga gcccgctggc 360 
gctggccttc gcgaccacac tcaactactg 420 
gggacggcgg ctgccagagt gagtgcccgg 480 
tgccatcacg cttgtgatgt ggtggccgtc 540 
tgtcagggcc ttggcccagg ggtcagcaga 600 
cccgggagca gtgtcctgtg ctttctgcat 660 
ttataagctg aataaagttt ttgacttcct 720 

738 



<220> 

<221> mis cofeature 

<223> Incyte ID No: 7509244CB1 



<400> 66 

agcctttggt gaagattgga tcgaaatcag 
ggacgggcct ggagtagaac tggaacgaga 
tagacctgga gtggactgac tacaggctga 
attcgctccg catcacggcg gaatccgtgt 
atgatgggaa ttttgacgtg gctctggaca 
tgcgttggca acccccgggc atctatcgca 



accaatggac aagctctggc cgtgggtggt 60 
aggatgaaga gatgagcaca aaggtgtact 120 
gctgggaccc tgcggagcac gacggcatcg 180 
ggctccctga cgtggtgcta ctgaacaaca 240 
ttagcgtcgt ggtgtcctcc gacggctccg 300 
gcagctgcag catccaggtc acctacttcc 360 



67/115 



WO 03/083085 



PCT/US03/09797 



ccttcgactg gcagaattgc actatggtgt 
tcagcctgca gacaggcctg ggtcctgacg 
aagggacttt cattgagaat ggccagtggg 
agcctccagg cgatcctagg ggagggaggg 
tcatcatccg ccgcaagcct ctcttctacc 
tcactcttct ggccatcttc gtcttctacc 
tctcaatctt tgccctgctg acccttactg 
ctgagacctc actatcagta cccattatta 
tcaccttctc agtcatcctt agtgtcgtgg 
cccaccaaat gcccctttgg gtccgtcaga 
gtctaaaaag gcccaaaccc gagagagacc 
ctatcagcta catcgctcga cagctgcagg 
actggcagtt tgtggccatg gtagtggacc 
ccagcgttgg gaccctagtc atcttcctgg 
cctttccttg aagactggag ggttgagacc 
ggtgatactg tcaagcccca tccttctctg 
ctcttatttc gtttctgggg actgcattgg 
cccacctgaa atgcagtatc atctgattta 
gtatcaacac ctaggtcgcc agtgaaatag 
tatgaggtca agaaatgtga cttggccggg 
tttgggaggc caaggcgggc ggatcacctg 
catggagaaa ccctgtctct actaaaaata 
tgtaatccca gctactaggg aggctgaggc 
gttgcagtga gtcaagatca cgccattgca 
atctcaaaaa aaaaaaagtg tcttgttcac 
tgacctatag taagtgctta acatttgttg 
acagcaatta ataatgttta taagctgggc 
ttaggaggct gaggtgggag gattacttga 
atagtgagac cctgtctcta aaaaaataaa 
aaaaaa 



tcagctccta cagctacgac agctcggagg 420 
ggcaagggca tcaggaaatc cacattcatg 480 
agattatcca caagccctct cggctaatcc 540 
aaggacagcg ccaggaagtc atcttctacc 600 
tggtcaacgt cattgcccca tgcatcctca 660 
tgccaccaga tgcaggagag aagatggggc 720 
tgttcctgct gctgctggct gacaaagtac 780 
tcaagtacct catgtttacc atggtcctcg 840 
ttctcaacct gcaccaccgc tcaccccaca 900 
tcttcattca caaacttccg ctgtacctgc 960 
tgatgccgga gctacgggag gtcgtctcct 1020 
aacaggagga ccacgatgcg ctgaaggagg 1080 
gcctcttcct gtggactttc atcatcttca 1140 
acgccacgta ccacttgccc cctccagacc 1200 
caggccccct gccagttgaa gtgagagttt 1260 
cctcttaact ccttcacgag gaatctgggc 1320 
actgagggct gggtaggcag gtgtcttgga 1380 
ctctttggga tcttgaagaa gctcttttgg 1440 
aacacagaac aggaactaga ttataagcct 1500 
cgcggtggct cacgcctgta atcccagcac 1560 
aggtcgggag tttgagacca gcccgaccaa 1620 
caaaattagc caggtgtggt ggtacatgcc 1680 
aggagaatca ctcgaacccg ggaggcagag 1740 
ctccagcctg ggcaacaaga gcgaaactcc 1800 
tgcagcatgc ccagtgccac gcacaggtgc 1860 
aataggggaa agaaatttcc ggaagtaaat 1920 
atggtcgctc aggccagtaa tcccagtgac 1980 
aacccacagt ttgagaccag cctgggcaac 2040 
aatagaaata aagtagtgct tattgtttgc 2100 

2106 



<210> 67 

<211> 2334 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No': 7509256CB1 



<400> 67 

ctcgagccgc tcgagccgcg ggcggctggg 
acccttagca gccactgcca cctgggcccg 
ggcaggggcg cctccggatc gatcttctga 
gacaactgcc tttttaattt taatttcctt 
gtcttcaaag aaagggaagg ggaaaaagaa 
agaggacctt gcccgagtac ctgccaactc 
cagttatgat cccaggataa gaccaaactt 
catttttatt aacagttttg gatccattca 
cttcctgaga caaaaatgga atgaccccag 
agatgcactg acagtggatc caacaatgta 
tgcaaatgaa aaaagtgcca attttcatga 
ttttcgtgat ggagatgtcc ttgtcagcat 
ggacttgaca ttgtttccca tggatacaca 
ttacacaact gatgatttac gatttatctg 
aattgccttg cctcaatttg atatcaaaaa 



acgcagctgc ccccgctcgg cgcccgctgc 60 
gagcctccac gatctcgccc ggcgattgtg 120 
aattcaagtt ttcaagatga agtttttatt 180 
gtgggtggaa gaagcctatt ctaaggaaaa 240 
gcagtatcta tgcccatctc agcagtcagc 300 
cactagcaat atcttgaaca ggttattggt 360 
caaaggcatt cctgttgatg tagtagtcaa 420 
agaaacaaca atggactata gagttaacat 480 
gctgaagctc cccagtgatt ttaggggttc 540 
caagtgttta tggaaacctg atttattttt 600 
tgtgacccag gaaaacatcc tcctctttat 660 
gaggttatct attactcttt catgcccttt 720 
acgttgcaag atgcaactgg agagctttgg 780 
gcagtcagga gatcctgtgc aattagaaaa 840 
ggaagatatt gaatatggta actgtacaaa 900 
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atactataaa ggcacgggct actacacatg cgtggaagtc atcttcaccc tgaggaggca 960 
ggtcggcttt tacatgatgg gggtctacgc cccaaccctg ctcattgttg ttctctcctg 1020 
gctttccttc tggatcaacc cggacgcgag tgctgccaga gtgcccctgg gttggtgaga 1080 
ccagatgcaa aaaagtttgt acttctaagt ctgatctgag atctaatgac ttcagcattg 1140 
ttggaagctt accaagagat tttgaactat ccaattatga ctgctatgga aaacccattg 1200 
aagttaacaa cggacttggg aaatctcagg ctaagaacaa caagaagcct ccccctgcga 1260 
aacctgttat tccaacagca gcaaagcgaa ttgatcttta tgcaagagca ttgtttcctt 1320 
tctgcttctt gttcttcaat gttatatatt ggtctatata tttatgataa atcttttcca 1380 
tttgtacaaa ataaaattcc atttcattgt gacctactcc tttcataaat gccaatctgt 1440 
gagaactttt gaattttcat agcaacattg cattttggat gccatttgat tgtaataaaa 1500 
ctgtggcacc ttaattttga atggcagcat gatcatgtaa tatctgtgct ctaataacga 1560 
tgtatatatg tatagtgaac atattgctta gtaacaaatg aaggacaagc atactacata 1620 
atataatcca tacaattctc ttcagttagt gtaaactgca aatactacag ataattctga 1680 
taataaaatg atatgcacgc tgaatcctgc tatggtcacc attctaatgt atgtagtatt 1740 
tcaaatttcc ttccttgtaa ctttcaaaga aagccatctt attcttgtaa aattttagat 1800 
ggtattatca cagatttaaa aaggttgtat tacatattgt ttaaactttg taagtagaaa 1860 
tatatctgtt ataattatac aggctctgtg gagaaataaa gttcaaaaaa tattaatttg 1920 
taaaatcagc tcgttttaaa gtgtgcttgt gttgtcaaaa atatcagata gtaatacaca 1980 
gtgagcattt ttaaacaaag ggaaacctat atttatgtaa ctgtatactg aattctgaca 2040 
aaataaaaaa agatacctta ttgacgaaat atttaggata aacaaaattc tatttaatcc 2100 
accttaaaac ctaaatgtat tttcatggat ttcatttgtt ggtacatatt acacaaaaca 2160 
ttgtgcctta aaatgagtca tacatctttt aaattggaat gcagtaatag atatgtgatt 2220 
ttacatcatt tttaagaaac caaggggaag taataagttg aaaaagaaat ccataactat 2280 
taaaagattt taactttttt attttattaa aatgcttgca tattttaagt aaaa 2334 

<210> 68 
<211> 1475 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7509395CB1 

<400> 68 

ccacggccag tgtgctcgaa tcggtacttc acaagctccg gtagcccatg gagccctggc 60 
ctctcctcct gctctttagc ctttgctcag ctggcctcgt cctgggctcc gaacatgaga 120 
cccgtctggt ggcaaagcta tttaaagact acagcagcgt ggtgcggcca gtggaagacc 180 
accgccaggt cgtggaggtc accgtgggcc tgcagctgat acagctcatc aatgtggatg 240 
aagtaaatca gatcgtgaca accaataact gcagcatgaa gctgggcacc tggacctacg 300 
acggctctgt cgtggccatc aacccggaaa gcgaccagcc agacctgagc aacttcatgg 360 
agagcgggga gtgggtgatc aaggagtccc ggggctggaa gcactccgtg acctattcct 420 
gctgccccga caccccctac ctggacatca cctaccactt cgtcatgcag cgcctgcccc 480 
tctacttcat cgtcaacgtc atcatcccct gcctgctctt ctccttctta actggcctgg 540 
tattctacct gcccacagac tcaggggaga agatgactct gagcatctct gtcttactgt 600 
ctttgactgt gttccttctg gtcatcgtgg agctgatccc ctccacgtcc agtgctgtgc 660 
ccttgattgg aaaatacatg ctgttcacca tggtgttcgt cattgcctcc atcatcatca 720 
ctgtcatcgt catcaacaca caccaccgct cacccagcac ccatgtcatg cccaactggg 780 
tgcggaaggt ttttatcgac actatcccaa atatcatgtt tttctccaca atgaaaagac 840 
catccagaga aaagcaagac aaaaagattt ttacagaaga cattgatatc tctgacattt 900 
ctggaaagcc agggcctcca cccatgggct tccactctcc cctgatcaaa caccccgagg 960 
tgaaaagtgc catcgagggc atcaagtaca tcgcagagac catgaagtca gaccaggagt 1020 
ctaacaatgc ggcggcagag tggaagtacg ttgcaatggt gatggaccac atactcctcg 1080 
gagtcttcat gcttgtttgc atcatcggaa ccctagccgt gtttgcaggt cgactcattg 1140 
aattaaatca gcaaggatga gcagaaaatg agctgagctt agctctgccc tggaacctac 1200 
cagagcagag aagggcagga gaggaagatt tgtctacttg ctccactcgc acttatcaaa 1260 



69/115 



WO 03/083085 



PCT/US03/09797 



cgtgttatat tccatactta ttattgatga taagatttac ctttatgtaa gtttatggcc 1320 

ttgaagtgtt ttcatattgc ttctcccttt agttctgctg tctccctgaa gagtgaaccc 1380 

tctttagtaa atgaaactaa tcactaaaaa aagtgttcat ttccagtgtc tggaagagtt 1440 
tttgccagga taaccgaggt ttctgttgca ttgca 1475 

<210> 69 

<211> 1295 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7503287CB1 

<400> 69 

ccaggacccg ccgccggctg ccggcttgcc gaagccccct caggatcccc tcaacaagga 60 

tggaactgaa ggccgaggag gaggaggtgg gtggcgtcca gccggtgagc atccaggcct 120 

tcgccagcag ctccacactg cacggcctgg cccacatctt ctcctacgag cggctgtctc 180 

tgaagcgggc actgtgggcc ctgtgcttcc tgggctcgct ggctgtgctg ctgtgtgtgt 240 

gcacggagcg tgtgcagtac tacttccact accaccatgt caccaagctc gacgaggtgg 300 

ctgcctctca gcttaccttc cctgctgtca cgctgtgcaa cctcaacgag ttccgcttta 360 

gccaagtctc caagaatgac ctgtatcatg ctggggagct gctggccctg ctcaacaaca 420 

ggtatgagat accagacaca cagatggcag atgaaaagca gctggagata ctgcaggaca 480 

aagccaactt ccgcagcttc aaacccaaac ccttcaacat gcgtgagttc tacgaccgag 540 

ctgggcacga cattcgagac atgctgctct cctgccactt ccggggggag gtctgcagcg 600 

ctgaagactt caaggtggtc ttcacacgct atggaaagtg ctacacgttc aactcgggcc 660 

gagatgggcg gccgcggctg aagaccatga agggtgggac gggcaatggg ctggaaatca 720 

tgctggacat ccagcaggac gagtacctgc ctgtgtgggg ggagactgac gagacgtcct 780 
tcgaagcagg catcaaagtg cagatctttc ctcttgtctg tggtaaggaa ggagtcttga 840 

ccatagagtc ctctctctgc ctctatccca ttctttttac atttaacaaa actaatctaa 900 

aaaagaacta aaaagggaga acggggcaag ggacctcagg ctgcccctct ctcctccatg 960 
ctgcctcccc tagctcccag cctgaattct gtctatctag ctgtctgcca tctgagtgtc 1020 

catctacatt ctgctgccac cagtcaccaa aggcccttcc cagtgagggg tggaagggat 1080 

ctctggggtc tggaatttgg ccccaaacca gagaatgtac cttaaggggg agggctagtg 1140 

tgggggaggg aggcttcccc agccttaaga gaccctctca gcccagtgac tgtccccaaa 1200 

cccaagtctc ctggcaggaa ctaaaacctc agccccactc tctcacacca tgtggaatct 1260 
cgtgggggtc ggggatcccc ttaagaagtg gtaat 1295 



<210> 70 

<211> 1386 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mi sc_f eature 

<223> Incyte ID No: 7503320CB1 

<400> 70 

tgcagctccg ggactcaaca tgcgctgctc 
gtcgctcctg cacgtgtccc tgcaaggcga 
caagaactac aatcccttgg agaggcccgt 
cttctccctg agcctcctgc agatcatgga 
cacgaccccg acgggggcaa gatgcccaag 
gcgtggttcc tgcgaatgaa gaggcccggg 
aagcagcggc gctgcagcct ggccagtgtg 
agcaacggga acctgctgta catcggcttc 



gccgggaggc gtctggctgg cgctggccgc 60 
gttccagagg aagctttaca aggagctggt 120 
ggccaatgac tcgcaaccac tcaccgtcta 180 
cgtggatgag aagaaccaag ttttaaccac 240 
tggaccagag tcatccttct gaactggtgc 300 
gaggacaagg tgcgcccggc ctgccagcac 360 
gagatgagcg ccgtggcgcc gccgcccgcc 420 
cgcggcctgg acggcgtgca ctgtgtcccg 480 
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acccccgact ctggggtagt gtgtggccgc 
ctcctgcacg gcgggcaacc ccccgagggg 
gtccgctaca ttgccaaccg cttccgctgc 
tggaagttcg ccgcctgtgt ggtggaccgc 
atcatctgca ccatcggcat cctgatgtcg 
gactttgcgt aaccacgcct ggttctgtac 
ttggcttggc gagatttggg ggtgctaatc 
gttcccttct ggctgtcagt cgtgttgctt 
tctcagcact ttgtttcata ttctcagatg 
catcggtcag cagggccact gagtagtcat 
cttcggagag ctccccatgg ctcctcacca 
aaggtctacc tgaaaattca acatttgctt 
aaaataaacc agactcacta aatcctttcc 
acaaaaacta aaaacctctt agcttttctg 
ctatcaaaga cggtagagag aaacagcttg 
tcggtc 

<210> 71 

<211> 2213 

<212> DNA 

<213> Homo sapiens 



atggcctgct cccccacgca cgatgagcac 540 
gacccggact tggccaagat cctggaggag 600 
caggacgaaa gcgaggcggt ctgcagcgag 660 
ctgtgcctca tggccttctc ggtcttcacc 720 
gctcccaact tcgtggaggc cgtgtccaaa 780 
atgtggaaaa ctcacagatg ggcaaggcct 840 
caggacagca ttacacgcca caactccagt 900 
acggtttctt tgttacttta ggtagtagaa 960 
ggctgataga tatccttggc acatccgtac 1020 
tttgcccatt agcccactgc ctggaaagcc 1080 
ccgagacagt tggttttgca tgtctgcatg 1140 
tttgcttgtg tacaaaccca gattgaagct 1200 
aataattgac tggtggaagg aaaacaaaaa 1260 
caattcaact ttttattttt atttttattt 1320 
atgctgtttc tacattaaaa aaaaaattgg 1380 

1386 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7503335CB1 



<400> 71 

gttactggga gggggcttgc tgtggccctg 
gcgcagggag ggaggctgtc accatgccgg 
atgagacgaa caaagtcact cggatccaga 
tccacgtgat catcttttcc tacgtttgct 
ggaaagagcc tgtcatcagt tctgtgcaca 
aggagatcgt ggagaatgga gtgaagaagt 
acaccttccc tttgcagggg aactctttct 
gccaagagca gcggttgtgt cccgagtatc 
gaggttgtaa aaagggatgg atggacccgc 
tagtgcatga agggaaccag aagacctgtg 
tggaagaggc cccccggcct gctctcttga 
agaacaatat cgacttcccc ggccacaact 
acatcacttg taccttccac aagactcaga 
acatcttccg agaacaggcg ataatttttc 
acaaggaaaa caatgttgag aaacggactc 
tcctggtttt tggcaccgga ggaaaatttg 
caaccctctc ctacttcggt ctggccgctg 
ccagtaactg ctgtcgctcc catatttatc 
tcaacgaata ctactacagg aagaagtgcg 
agtatgtgtc ctttgtggat gaatcccaca 
gaagtctgca agatgtcaag ggccaagaag 
tgtccaggct gcccctggcc ctccatgaca 
tacagctgct tagaaaggag gcgactccta 
gtggaagctg cctcccatct caactccctg 
gccggaaaaa gccgggggcc tgcatcacca 
ccagacacgt cctgcagttc ctcctgctct 
attccaccaa cagccggctg cggcactgtg 
gctcccagga catggctgac tttgccatcc 
aagagtttcc aaagagtgaa gggcagtaca 



tcaggaagag tagagctctg gtccagctcc 60 
cctgctgcag ctgcagtgat gttttccagt 120 
gcatgaatta tggcaccatt aagtggttct 180 
ttgctctggt gagtgacaag ctgtaccagc 240 
ccaaggtgaa ggggatagca gaggtgaaag 300 
tggtgcacag tgtctttgac accgcagact 360 
tcgtgatgac aaactttctc aaaacagaag 420 
ccacccgcag gacgctctgt tcctctgacc 480 
agagcaaagg aattcagacc ggaaggtgtg 540 
aagtctctgc ctggtgcccc atcgaggcag 600 
acagtgccga aaacttcact gtgctcatca 660 
acaccacgag aaacatcctg ccaggtttaa 720 
atccacagtg tcccattttc cgactaggag 780 
agatgtggca attcagatac gccaagtact 840 
tgataaaagt cttcgggatc cgttttgaca 900 
acattatcca gctggttgtg tacatcggct 960 
tgttcatcga cttcctcatc gacacttact 1020 
cctggtgcaa gtgctgtcag ccctgtgtgg 1080 
agtccattgt ggagccaaag ccgacattaa 1140 
ttaggatggt gaaccagcag ctactaggga 1200 
tcccaagacc tgcgatggac ttcacagatt 1260 
cacccccgat tcctggacaa ccagaggaga 1320 
gatccaggga tagccccgtc tggtgccagt 1380 
agagccacag gtgcctggag gagctgtgct 1440 
cctcagagct gttcaggaag ctggtcctgt 1500 
accaggagcc cttgctggcg ctggatgtgg 1560 
cctacaggtg ctacgccacc tggcgcttcg 1620 
tgcccagctg ctgccgctgg aggatccgga 1680 
gtggcttcaa gagtccttac tgaagccagg 1740 
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caccgtggct cacgtctgta atcccagcgc tttgggaggc cgaggcaggc agatcacctg 1800 

aggtcgggag ttggagaccc gcctggctaa caaggcgaaa tcctgtctgt actaaaaata 1860 

caaaaatcag ccagacatgg tggcatgcac ctgcaatccc agctactcgg gaggctgagg 1920 

cacaagaatc acttgaaccc gggaggcaga ggttgtagtg agcccagatt gtgccactgc 1980 

tctccagcct gggaggcaca gcaaactgtc ccaaaaaaaa caaaagagtc cttaccaata 2040 

gccgggggct tgcagtaccc atgttaactt gaccatttac caagcaattg aacttcacct 2100 

gcaaagctct gtggccactt ttccagccaa agggaaatat gcttcacctc ggtggccctc 2160 

ggggtctgaa gcaaaggggc ctggttaaac aaccaaaaat ccttaaatgg age 2213 

<210> 72 

<211> 1289 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_jEeature 

<223> Incyte ID No: 7503952CB1 

<400> 72 

gagaaattga gcggcattcc atctggtagg caagtttgea tttctccttt ttgggatctg 60 

cccaggaatg ttgtcaagtg taatggctcc cctgtgggcc tgcatcctgg tggctgeagg 120 

aattctagee acagatacac atcatcccca ggattctget ctgtatcatc tcagcaagca 180 

gctattacag aaatatcata aagaagtgag acctgtttac aactggacca aggccaccac 240 

agtctacctg gacctgttcg tecatgetat attggatgtg gatgeagaga atcaaatatt 300 

aaagacaagt gtatggtacc aagaggtctg gaatgatgaa tttttatcct ggaactccag 360 

catgtttgat gagattagag agatctccct acctctaagt gccatctggg cccccgatat 420 

catcatcaat gagtttgtgg acattgaaag ataccctgac cttccctatg tttatgtgaa 480 

ctcatctggg accattgaga actataagee catccaggtg gtctctgcgt gcagtttaga 540 

gaeatatget tttccatttg atgtccagaa ttgcagcctg accttcaaga geattctgea 600 

tacagtggaa gaegtagace tggectttet gaggagecca gaagacattc agcatgacaa 660 

aaaggcgttt ttgaatgaca gtgagtggga acttctatct gtgtcctcca catacagcat 720 

cctgcagagc agegctggag gatttgeaca gattcagttt aatggcactt cttcaccatc 780 

tgcatggcct tcttggttct cagcttagct aagtccatcg tgttggtcaa- attcctccat 840 

gatgagcagc gtggtggaca ggagcagccc ttcttgtgcc ttcgagggga caccgatgct 900 

gaeaggecta gagtggaacc cagggcccaa cgtgctgtgg taacagagtc ctegctgtat 960 

ggagagcacc tggcccagcc aggaaccctg aaggaagtct ggtegcaget tcaatctatc 1020 

agcaactacc tccaaactca ggaccagaca gaccaacagg aggcagagtg gctggtcctc 1080 

ctgtcccgct ttgaccgact gctcttccaa agctaccttt tcatgctggg gatctacacc 1140 

atcactctgt gctccctctg ggcactgtgg ggcggcgtgt gaagactgaa gtgttcttca 1200 

gtaattgtgc tggcacttag gagagagagg agggggaata atagtgggtt aaaaagcttt 1260 

ctgggtcggg tgtggtggtt ettgectat 1289 

<210> 73 

<211> 1358 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7504530CB1 

<400> 73 

cgctcgaggc cgccctggga gagctgagag gcaggctctg gactggggac acagggatag 60 
ctgagcccca gctgggggtg gaagctgagc cagggacagt caeggaggaa caagatcaag 120 
atgcgctgta actgagaagc ccccaaggcg gaggctgaga atcagagaca tttcagcaga 180 
catctacaaa tctgaaagac aaaacatggt tcaagcatcc gggcacaggc ggtccacccg 240 
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tggctccaaa atggtctcct ggtccgtgat 
gatggtgcga gagttcctgg ccgagttcat 
tggttccgtg gcccatatgg ttctaaataa 
gggttttggc ttcggagtca ccatgggagt 
catgaacgca gctgtgacct ttgctaactg 
tccggtctat gtgctggggc agttcctggg 
tctcttctac acggccattc tccacttttc 
cgctacagct ggcatttttg ccacctacct 
cctgaatgag gcgtggctga ccgggatgct 
ggagaacaac ccagcactgc caggaacaga 
catcggggtg tcccttggca tgaacacagg 
cccccgcatc ttcaccttca ttgctggttg 
acctggtctt cattggctcc accatcccac 
cgtatgaaga ccacgggata accgtattgc 
ctcccctcac ccccgtctct gtgagccctg 
ccttacatga atccatggcc ctagagcact 
attccccaat aaagcaaggc ttgtccgaca 
tggttgggct tccctcctgg gttcttacag 
gtagaggtga ggcacctaag gccttccatc 

<210> 74 

<211> 2232 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7509303CB1 



agcaaagatc caggaaatac tgcagaggaa 300 
gagcacatat gtcatgatgg tattcggcct 360 
aaaatatggg agctaccttg gtgtcaactt 420 
gcacgtggca ggccgcatct ctggagccca 480 
tgcgctgggc cgcgtgccct ggaggaagtt 540 
ctccttcctg gcggctgcca ccatctacag 600 
gggtggacag ctgatggtga ccggtcccgt 660 
tcctgatcac atgacattgt ggcggggctt 720 
ccagctgtgt ctcttcgcca tcacggacca 780 
ggcgctggtg ataggcatcc tcgtggtcat 840 
atatgccatc aacccgtccc gggacctgcc 900 
gggcaaacag gtcttcaggt ggcatcatct 960 
gggagcccct gaaattggag gattctgtgg 1020 
ccaagatggg atctcatgaa cccacgatct 1080 
ccaacagatc ttcagtccac cctgccccac 1140 
tctaagcaga gattatttgt gatcccatcc 1200 
gcagtacccc cacttcctgg gggcctcctg 1260 
gagctccagg gctatgtctt agcccaaggg 1320 
cccgggag 1358 



<400> 74 

ggaccctaga cctctgcagc ccataccagg 
ggactctagc cctccacagt ccactccagg 
ggggctgggc cccgaacctg cggcgcccca 
cgagttccac cgctcctacc gagagctctt 
cggcgccatc cgcctggtgt gctcccagca 
gctgtggctc tgcacctttg gcatgatgta 
cttcagctac cccgtcagcc tcaacatcaa 
agtgaccatc tgcaccctca atccctacag 
gctggaccgc atcacagagc agacgctctt 
tctcgtggcc ggctcccgca gccgtcgcga 
gcgcctgagg gtcccgcccc cgcctcacgg 
cttgcgggac aacaaccccc aggtggactg 
attactctca cttccaccac ccgatgtatg 
actccaacct ctggatgtct tccatgcctg 
gcgcagagca gaatgacttc attcccctgc 
tgcacgggca ggatgaacct gcctttatgg 
tggagacctc catcagcatg aggaaggaaa 
actgcaccaa gaatggcagt gatgttcctg 
agcaggtgtg tattcactcc tgcttccagg 
acatcttcta tccgcggccc cagaacgtgg 
gggggtactg ctactataag ctccaggttg 
ccaagtgccg gaagccatgc agcgtgacca 
ggccctcggt gacatcccag gaatgggtct 
ccgtcaacaa caagagaaat ggagtggcca 
acaaaaccaa ttctgagtct ccctctgtca 
gccagtggag cctgtggttc ggctcctcgg 



tctcatggag gggaacaagc tggaggagca 60 
gctcatgaag gggaacaagc gtgaggagca 120 
gcagcccacg gcggaggagg aggccctgat. 180 
cgagttcttc tgcaacaaca ccaccatcca 240 
caaccgcatg aagacggcct tctgggcagt 300 
ctggcaattc ggcctgcttt tcggagagta 360 
cctcaactcg gacaagctcg tcttccccgc 420 
gtacccggaa attaaagagg agctggagga 480 
tgacctgtac aaatacagct ccttcaccac 540 
cctgcggggg actctgccgc accccttgca 600 
ggcccgtcga gcccgtagcg tggcctccag 660 
gaaggactgg aagatcggct tccagctgga 720 
gaaactgcta tactttcaat gacaagaaca 780 
gaatcaacaa cggtctgtcc ctgatgctgc 840 
tgtccacagt gactggggcc cgggtaatgg 900 
atgatggtgg ctttaacttg cggcctggcg 960 
ccctggacag acttgggggc gattatggcg 1020 
ttgagaacct ttacccttca aagtacacac 1080 
agagcatgat caaggagtgt ggctgtgcct 1140 
agtactgtga ctacagaaag cacagttcct 1200 
acttctcctc agaccacctg ggctgtttca 1260 
gctaccagct ctctgctggt tactcacgat 1320 
tccagatgct atcgcgacag aacaattaca 1380 
aagtcaacat cttcttcaag gagctgaact 1440 
cgatggtcac cctcctgtcc aacctgggca 1500 
tgttgtctgt ggtggagatg gctgagctcg 1560 
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tctttgacct gctggtcatc atgttcctca tgctgctccg aaggttccga agccgatact 1620 
ggtctccagg ccgagggggc aggggtgctc aggaggtagc ctccaccctg gcatcctccc 1680 
ctccttccca cttctgcccc caccccatgt ctctgtcctt gtcccagcca ggccctgctc 1740 
cctctccagc cttgacagcc cctccccctg cctatgccac cctgggcccc cgcccatctc 1800 
cagggggctc tgcaggggcc agttcctcca cctgtcctct gggggggccc tgagagggaa 1860 
ggagaggttt ctcacaccaa ggcagatgct cctctggtgg gagggtgctg gccctggcaa 1920 
gattgaagga tgtgcagggc ttcctctcag agccgcccaa actgccgttg atgtgtggag 1980 
gggaagcaag atgggtaagg gctcaggaag ttgctccaag aacagtagct gatgaagctg 2040 
cccagaagtg ccttggctcc agccctgtac cccttggtac tgcctctgaa cactctggtt 2100 
tccccaccca actgcggcta agtctctttt tcccttggat cagccaagcg aaacttggag 2160 
ctttgacaag gaactttcct aagaaaccgc tgataaccag gacaaaacac aaccaaggta 2220 
cacgcaggca tg 2232 

<210> 75 

<211> 2230 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7509910CB1 



<400> 75 

gttactggga gggggcttgc tgtggccctg 
gcgcagggag ggaggctgtc accatgccgg 
atgagacgaa caaagtcact cggatccaga 
tccacgtgat catcttttcc tacgtttgct 
ggaaagagcc tgtcatcagt tctgtgcaca 
aggagatcgt ggagaatgga gtgaagaagt 
acaccttccc tttgcagggg aactctttct 
gccaagagca gcggttgtgt cccgagtatc 
gaggttgtaa aaagggatgg atggacccgc 
tagtgcatga agggaaccag aagacctgtg 
tggaagaggc cccccggcct gctctcttga 
agaacaatat cgacttcccc ggccacaact 
acatcacttg taccttccac aagactcaga 
acatcttccg agaaacaggc gataattttt 
gcattgagat ctactgggac tgcaacctag 
acagtttccg tcgccttgac gacaagacca 
tcagatacgc caagtactac aaggaaaaca 
tcgggatccg ttttgacatc ctggtttttg 
tggttgtgta catcggctca accctctcct 
atgctttagg aaaatggttt ggagaaggaa 
cccaggccgc tgtgttcatc gacttcctca 
cccatattta tccctggtgc aagtgctgtc 
ggaagaagtg cgagtccatt gtggagccaa 
atgaatccca cattaggatg gtgaaccagc 
agggccaaga agtcccaaga cctgcgatgg 
ccctccatga cacacccccg attcctggac 
aggcgactcc tagatccagg gatagccccg 
ctcaactccc tgagagccac aggtgcctgg 
cctgcatcac cacctcagag ctgttcagga 
tcctcctgct ctaccaggag cccttgctgg 
tgcggcactg tgcctacagg tgctacgcca 
actttgccat cctgcccagc tgctgccgct 
aagggcagta cagtggcttc aagagtcctt 



tcaggaagag tagagctctg gtccagctcc 60 
cctgctgcag ctgcagtgat gttttccagt 120 
gcatgaatta tggcaccatt aagtggttct 180 
ttgctctggt gagtgacaag ctgtaccagc 240 
ccaaggtgaa ggggatagca gaggtgaaag 300 
tggtgcacag tgtctttgac accgcagact 360 
tcgtgatgac aaactttctc aaaacagaag 420 
ccacccgcag gacgctctgt tcctctgacc 480 
agagcaaagg aattcagacc ggaaggtgtg 540 
aagtctctgc ctggtgcccc atcgaggcag 6&0 
acagtgccga aaacttcact gtgctcatca 660 
acaccacgag aaacatcctg ccaggtttaa 720 
atccacagtg tcccattttc cgactaggag 780 
cagatgtggc aattcagggc ggaataatgg 840 
accgttggtt ccatcactgc cgtcccaaat 900 
ccaacgtgtc cttgtaccct ggctacaact 960 
atgttgagaa acggactctg ataaaagtct 1020 
gcaccggagg aaaatttgac attatccagc 1080 
acttcggtct ggtaagagat tctcttttcc 1140 
gtgactaacg cagcgcttgt ctgcattctc 1200 
tcgacactta ctccagtaac tgctgtcgct 1260 
agccctgtgt ggtcaacgaa tactactaca 1320 
agccgacatt aaagtatgtg tcctttgtgg 1380 
agctactagg gagaagtctg caagatgtca 1440 
acttcacaga tttgtccagg ctgcccctgg 1500 
aaccagagga gatacagctg cttagaaagg 1560 
tctggtgcca gtgtggaagc tgcctcccat 1620 
aggagctgtg ctgccggaaa aagccggggg 1680 
agctggtcct gtccagacac gtcctgcagt 1740 
cgctggatgt ggattccacc aacagccggc 1800 
cctggcgctt cggctcccag gacatggctg 1860 
ggaggatccg gaaagagttt ccgaagagtg 1920 
actgaagcca ggcaccgtgg ctcacgtctg 1980 
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taatcccagc gctttgggag gccgaggcag gcagatcacc tgaggtcggg agttggagac 2040 

ccgcctggct aacaaggcga aatcctgtct gtactaaaaa tacaaaaatc agccagacat 2100 

ggtggcatgc acctgcaatc ccagctactc gggaggctga ggcacaagaa tcacttgaac 2160 

ccgggaggca gaggttgtag tgagcccaga ttgtgccact gctctccagc ctgggaggca 2220 

cagcaaactg 2230 

<210> 76 

<211> 5966 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7509982CB1 

<400> 76 

gcttggtctg gctacacccc ttttcagaaa ccaggctgtg taagagctgc tggagtaggc 60 
acccatttaa agaaaaaatg aagaagcagc aataaagaag ttgtaatcgt tacctagaca 120 
aacagagaac tggttttgac agtgtttcta gagtgctttt tattattttc ctgacagttg 180 
tgttccacca tgattacttt ctccttcagc gaataggcta aatgaatatg aaacagaaaa 240 
gcgtgtatca gcaaaccaaa gcacttctgt gcaagaattt tcttaagaaa tggaggatga 300 
aaagagagag cttattggaa tggggcctct caatacttct aggactgtgt attgctctgt 360 
tttccagttc catgagaaat gtccagtttc ctggaatggc tcctcagaat ctgggaaggg 420 
tagataaatt taatagctct tctttaatgg ttgtgtatac accaatatct aatttaaccc 480 
agcagataat gaataaaaca gcacttgctc ctcttttgaa aggaacaagt gtcattgggg 540 
caccaaataa aacacacatg gacgaaatac ttctggaaaa tttaccatat gctatgggaa 600 
tcatctttaa tgaaactttc tcttataagt taatattttt ccagggatat aacagtccac 660 
tttggaaaga agatttctca gctcattgct gggatggata tggtgagttt tcatgtacat 720 
tgaccaaata ctggaataga ggatttgtgg ctttacaaac agctattaat actgccatta 780 
tagaaatcac aaccaatcac cctgtgatgg aggagttgat gtcagttact gctataacta 840 
tgaagacatt acctttcata actaaaaatc ttcttcacaa tgagatgttt attttattct 900 
tcttgcttca tttctcccca cttgtatatt ttatatcact caatgtaaca aaagagagaa 960 
aaaagtctaa gaatttgatg aaaatgatgg gtctccaaga ttcagcattc tggctctcct 1020 
ggggtctaat ctatgctggc ttcatcttta ttatttccat attcattaca attatcataa 1080 
cattcaccca aattatagtc atgactggct tcatggtcat atttataccc ttttttttat 1140 
atggcttatc tttggtagct ttggtgttcc tgctgagtgt gctgttaaag aaagctgtcc 1200 
tcaccaattt ggttgtgttt ctccttaccc tcttttgggg atgtctggga ttcactgtat 1260 
tttatgaaca acttccttca tctctggagt ggattttgaa tatttgtagc ccttttgcct 1320 
ttactactgg aatgattcag attatcaaac tggattataa cttgaatggt gtaatttttc 1380 
ctgacccttc aggagactca tatacaatga tagcaacttt ttctatgttg cttttggatg 1440 
gtctcatcta cttgctattg gcattatact ttgacaaaat tttaccctat ggagatgagc 1500 
gccattattc tcctttattt ttcttgaatt catcatcttg tttccaacac caaaggacta 1560 
atgctaaggt tattgagaaa gaaatcgatg ctgagcatcc ctctgatgat tattttgaac 1620 
cagtagctcc tgaattccaa ggaaaagaag ccatcagaat cagaaatgtt aagaaggaat 1680 
ataaaggaaa atctggaaaa gtggaagcat tgaaaggctt gctctttgac atatatgaag 1740 
gtcaaatcac ggcaatcctg ggtcacagtg gagctggcaa atcttcactg ctaaatattc 1800 
ttaatggatt gtctgttcca acagaaggat cagttaccat ctataataaa aatctctctg 1860 
aaatgcaaga cttggaggaa atcagaaaga taactggcgt ctgtcctcaa ttcaatgttc 1920 
aatttgacat actcaccgtg aaggaaaacc tcagcctgtt tgctaaaata aaagggattc 1980 
atctaaagga agtggaacaa gaggtacaac gaatattatt ggaattggac atgcaaaaca 2040 
ttcaagataa ccttgctaaa catttaagtg aaggacagaa aagaaagctg acttttggga 2100 
ttaccatttt aggagatcct caaattttgc ttttagatga accaactact ggattggatc 2160 
ccttttccag agatcaagtg tggagcctcc tgagagagcg tagagcagat catgtgatcc 2220 
ttttcagtac ccagtccatg gatgaggctg acatcctggc tgatagaaaa gtgatcatgt 2280 
ccaatgggag actgaagtgt gcaggttctt ctatgttttt gaaaagaagg tggggtcttg 2340 
gatatcacct aagtttacat aggaatgaaa tatgtaaccc agaacaaata acatccttca 2400 
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ttactcatca catccccgat gctaaattaa 
ctttgccact ggaaaggaca aatacatttc 
ctgaccaggg agtgacaggt tatgacattt 
aactggaagg a'cagtcaact atcgaacaag 
tggagatgat aagagactca gaaagcctca 
ctgaaatgca gacagctgtg agtgacatgg 
cacggctccg tttcttaaag ttaaaacgtc 
tatttggaat cgcaatattc cctttgattg 
aaaagatcga ttgggaattt aaaaacgaat 
aggaaccccg taccagcctg ttgatcatca 
taaaatcact gaagcatcaa aatatacttt 
gtactgatgg cctctcatac aatggagcta 
gattttcagt tgtgtgtaat accaagagat 
tcagtaatgg gctacttcaa atgtttaatc 
catttcctct tagccacata ggactctgga 
ttttggttct atgtagcatt tctccttata 
aaaatgctaa gtcccagcta tggatttcag 
aggcactagt ggacgtcagc ttcttcattt 
acatagaaaa catgcagtac cttcttatta 
ttactcctgg ttatgcagct tctcttgtct 
gcaaaaggag aaaaaacagt ggcctttggt 
tgttttccat cactttaatc aatcattttg 
tggttccttc atataccttg cttggattta 
actacagaga atttccagag gcaaattttg 
tcatacccta ctttcagact ttgctattcg 
gtggaaagaa aagaatgcga aaagatcctg 
ctaagccaaa tccagaagaa cccatagatg 
gaacagtcac tgctctgacc acttcaatct 
gtctacacaa agaatatgca ggccagaaga 
tagcagcaag aaatatctct ttctgtgttc 
ccagtggtgc tggaaaaagt tcatctatta 
ctggagaggt ggaactgaaa ggctgcagtt 
aagagaacgt gctgtggccc atgctgacgt 
tcaaggggct cagggaagcg gacgcgaggc 
aactgcatga gcagctgaat gttcctgtgc 
tgtgttttgt gctgagcctc ctgggaaact 
cgggcataga ccccacaggg cagcagcaaa 
acacagagag aggtgtcctc ctgaccaccc 
accgtgtggc catcatggtg tctggaaggc 
aaaacaaact tggcaaggat tacattctag 
ctttggtcca cactgagatt ctgaagcttt 
cctctttgtt aacctataag ctgcccgtgg 
acaaattaga agcagtgaag cataacttta 
cactggagaa ggtattctta gagctttcta 
aaattgatac aacaatgaga tggaaactcc 
aacctagtaa ttttttgttg atctcctata 
tgtttaattt taaagatcat ttaaaattaa 
aaatacataa attttaaaat tattcttcct 
ataaaggcaa tacaaaatat tagtaaagtc 
aaataaaact atataaactt agaatttttt 
cattctcttg ctgaaatatg tgaagggtat 
cacaccagtt catgatacaa caggtataca 
gagtcttctg aaagtgttcc agaaattgct 
gtggctcacg cctgtaatcc cagcactttg 
ggagttcgag actagcct'gg ccaatatggt 
ttagccgggc gttggcattt gcctgtagtc 



aaacagaaaa caaagaaaag cttgtatata 2460 
cagatctttt cagtgatctg gataagtgtt 2520 
ccatgtcaac tctaaatgaa gtctttatga 2580 
gtaaagccat ttgtataaat ttcgaacaag 2640 
atgaaatgga gctggctcac tcttccttct 2700 
gcctctggag aatgcaagtc tttgccatgg 2760 
aaactaaagt gttattgacc ctattattgg 2820 
ttgaaaatat aatatatgct atgttaaatg 2880 
tgtattttct ctctcctgga caacttcccc 2940 
ataacacaga atcaaatatt gaagatttta 3000 
tggaagtaga tgactttgaa aacagaaatg 3060 
tcatagtttc tggtaaacaa aaggattata 3120 
tgcactgttt tccaattctt atgaatatta 3180 
acacacaaca tattcgaatt gagtcaagcc 3240 
ctgggttgcc ggatggttcc tttttcttat 3300 
tcaccatggg cagcatcagt gattacaaga 3360 
gcctctacac ttctgcttac tggtgtgggc 3420 
taattctcct tttaatgtat ttaattttct 3480 
caagccaaat tgtgtttgct ttggttatag 3540 
tcttcatata tatgatatca tttatttttc 3600 
cattttactt cttttttgcc tccaccatca 366a 
acctaagtat attgattacc accatggtat 3720 
aaactttttt ggaagtgaga gaccaggagc 3780 
aattgagtgc cactgatttt ctagtctgct 3840 
tttttgttct aagatacatg gaactaaaat 3900 
ttttcagaat ttccccccaa agtagagatg 3960 
aagatgaaga tattcaaaca gaaagaataa 4020 
tagatgagaa acctgttata attgccagct 4080 
aaagttgctt ttcaaagagg aagaagaaaa 4140 
aagaaggtga gattttggga ttgctaggac 4200 
gaatgatatc tgggatcaca aagccaactg 4260 
cagttttggg ccacctgggg tactgccctc 4320 
tgagggaaca cctggaggtg tatgctgccg 4380 
tcgccatcgc aagattagtg agtgctttca 4440 
agaaattaac agcaggaatc acgagaaagt 4500 
cacctgtctt gctcctggat gaaccatcta 4560 
tgtggcaggc aatccaggca gtcgttaaaa 4620 
ataacctggc tgaggcggaa gccttgtgtg 4680 
ttagatgcat tggctccatc caacacctga 4740 
agctaaaagt gaaggaaacg tctcaagtga 4800 
tcccacaggc tgcagggcag gaaaggtatt 4860 
cagacgttta ccctctatca cagacctttc 4920 
acctggaaga atacagcctt tctcagtgca 4980 
aagaacagga agtaggaaat tttgatgaag 5040 
tccctcattc agatgaacct taaaacctca 5100 
aacttatgtt ttatgtaata attaatagta 5160 
catcaggtat attttgtaaa tttagttaac 5220 
ctcaaacata ggggtgatag caaacctgtg 5280 
acccaaagag tcaggcactg ggtattgtgg 5340 
aaaaatatga cttttttacc ttttacaaaa 5400 
attcagtagc caagagttgc atgactactt 5460 
ggttttcttt tataaccaac tacaactcaa 5520 
ttaaaactca aaagtaaggg gccaggtgca 5580 
ggaggccgag gcaggtggat cacaaggtca 5640 
gaaactccat ctctaataaa aatacaaaaa 5700 
ctagctattc gggaggctga gggaggagaa 5760 
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ttgcttgaac ccgggaggca gaggttgcag tgagccatgt gctagtgcac tccagcctgg 5820 

gtgacagagt gagactctgt caaaaaaaaa aaaaacaaaa aaaaaaacaa aaaaccttca 5880 

aggttttgga ggtctttggc cacaatttga gagccccttt tggaaaggtt tcccttttac 5940 

ttttgaataa agggtccgga tttggc 5966 

<210> 77 

<211> 2071 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 7510082CB1 

<400> 77 

ccgggtagtg agcggaggga caggaagggt agggcaagaa agggagaggg gacaggaggg 60 
aagggtgggc caaagcggtg agaaaggagg gccagccagt tgcgtggggg agagtggccg 120 
aggcccgggg gcaggagtgc agggctctga ggcggggaga ggagaggaga gaagagccgc 180 
ggggggccca gcccggagcc aggatgcccg cgccgcgcgc ccgggagcag ccccgcgtgc 240 
ccggggagcg ccagccgctg ctgcctcgcg gtgcgcgggg ccctcgacgg tggcggcggg 300 
cggcgggcgc ggccgtgctg ctggtggaga tgctggagcg cgccgccttc ttcggcgtca 360 
ccgccaacct cgtgctgtac ctcaacagca ccaacttcaa ctggaccggc gagcaggcga 420 
cgcgcgccgc gctggtattc ctgggcgcct cctacctgct ggcgcccgtg ggcggctggc 480 
tggccgacgt gtacctgggc cgctaccgcg cggtcgcgct cagcctgctg ctctacctgg 540 
ccgcctcggg cctgctgccc gccaccgcct tccccgacgg ccgcagctcc ttctgcggag 600 
agatgcccgc gtcgccgctg ggacctgcct gcccctcggc cggctgcccg cgctcctcgc 660 
ccagccccta ctgcgcgccc gtcctctacg cgggcctgct gctactcggc ctggccgcca 720 
gctccgtccg gagcaacctc acctccttcg gtgccgacca ggtgatggat ctcggccgcg 780 
acgccacccg ccgcttcttc aactggtttt actggagcat caacctgggt gctgtgctgt 840 
cgctgctggt ggtggcgttt attcagcaga acatcagctt cctgctgggc tacagcatcc 900 
ctgtgggctg tgtgggcctg gcatttttca tcttcctctt tgccaccccc gtcttcatca 960 
ccaagccccc gatgggcagc caagtgtcct ctatgcttaa gctcgctctc caaaactgct 1020 
gcccccagct gtggcaacga cactcggcca gagaccgtca atgtgcccgc gtgctggccg 1080 
acgagaggtc tccccagcca ggggcttccc cgcaagagga catcgccaac ttccaggtgc 1140 
tggtgaagat cttgcccgtc atggtgaccc tggtgcccta ctggatggtc tacttccaga 1200 
tgcagtccac ctatgtcctg cagggtcttc acctccacat cccaaacatt ttcccagcca 1260 
acccggccaa catctctgtg gccctgagag cccagggcag cagctacacg atcccggaag 1320 
cctggctcct cctggccaat gttgtggtgg tgctgattct ggtccctctg aaggaccgct 1380 
tgatcgaccc tttactgctg cggtgcaagc tgcttccctc tgctctgcag aagatggcgc 1440 
tggggatgtt ctttggtttt acctccgtca ttgtggcagg agtcctggag atggagcgct 1500 
tacactacat ccaccacaac gagaccgtgt cccagcagat tggggaggtc ctgtacaacg 1560 
cggcaccact gtccatctgg tggcagatcc ctcagtacct gctcattggg atcagtgaga 1620 
tctttgccag catcccaggc ctggagtttg cctactcaga ggccccgcgc tccatgcagg 1680 
gcgccatcat gggcatcttc ttctgcctgt cgggggtggg ctcactgttg ggctccagcc 1740 
tagtggcact gctgtccttg cccgggggct ggctgcactg ccccaaggac tttgggaaca 1800 
tcaacaattg ccggatggac ctctacttct tcctgctggc tggcattcag gccgtcacgg 1860 
ctctcctatt tgtctggatc gctggacgct atgagagggc gtcccagggc ccagcctccc 1920 
acaggccgtt tcagcacgga caggggctcg acaggcccta tccaggcccc ttggtttact 1980 
ctaccggaaa gaacggcagc agtcccagct ctggtttcct tctcggttta ttctgttaga 2040 
atgaaatgtt cccataaata agggcatggt c 2071 

<210> 78 
<211> 3703 
<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 7510367CB1 

<220> 

<221> unsure 

<222> (1) ... (3703) 

<223> a, t, c, g, or other 

<400> 78 

cggacggtgg gcggcgtgac gtagtggctg tgccccgttc ttgccccctc agtactagag 60 
tctccggctt cgctcacgcg ccttgggcat aagagtcctc tcgttggtcc cggaggtggg 120 
gttgcgctca caaggggcga ccgtcgccac ggtggcggcc actgcatcgc gtcccacctc 180 
cgcggccctg ggcgccgtgg tgtcgacggg ccccgagcct atgacgggcc agggccagtc 240 
ggcgtccggg tcgtcggcgt ggagcacggt attccgccac gtccggtatg agaacctgat 300 
agcgggcgtg agcggcggcg tcttatccaa ccttgcgctg catccgctcg acctcgtgaa 360 
gatccgcttc gccggaacaa tcctgtgagt ttatgatatc tgctatacag atgaggataa 420 
cagaagttga ctaacttgcc caagtcacaa gtttgtaaag atggatctgg gatttaaacc 480 
cagtgagtga tggattggaa ctgagaccga aatataatgg aattttacat tgcttgacta 540 
ccatttggaa acttgatgga ctacggggac tttatcaagg agtaacccca aatatatggg 600 
gtgcaggttt atcctgggga ctctactttt tcttttacaa tgccatcaag tcatataaaa 660 
cagaaggaag agctgaacgt ttagaggcaa cagaatacct tgtctcagct gctgaagctg 720 
gagccatgac cctctgcatt acaaacccat tatgggtaac aaaaactcgc cttatgttac 780 
agtatgatgc tgttgttaac tccccacacc gacaatataa aggaatgttt gatacacttg 840 
tgaaaatata taagtatgaa ggtgtgcgtg gattatataa gggatttgtt cctgggctgt 900 
ttggaacatc gcatggtgcc cttcagttta tggcatatga attgctgaag ttgaagtaca 960 
accagcatat caatagatta ccagaagccc agttgagcac agtagaatat atatctgttg 1020 
cagcactatc caaaatattt gctgtcgcag caacataccc atatcaagtc gtaagagctc 1080 
gtcttcagga tcaacacatg ttttacagtg gtgtaataga tgtaatcaca aagacatgga 1140 
ggaaagaagg cgtcggtgga ttttacaagg gaattgctcc taatttgatt agagtgactc 1200 
cagcctgctg tattaccttt gtggtatatg aaaacgtctc acatttttta cttgacctta 1260 
gagaaaagag aaagtaagct caaagaggac aattccagta tatctgccca aggcagcaac 1320 
aagctctttt gtgtttaagg cataaaagaa gaattctgca tagaaacatg gctcatattc 1380 
gaaattgctc tatagtcatt agaagccaga gaactgctaa gtctcctgca atgtttttct 1440 
gctttttgcc ttccccatat atatggaact tggctacctc tgcctgaaat ggctgccatc 1500 
aacacaatgt taaaactgac acgaaggata gagtttcaca gatttctacg ttttattggt 1560 
ggaagctgat ttgcaacatt tgctaaatgg attagatgaa tgtacttctt tttgtgagct 1620 
tacttgcctg gattgcttta aaattaacct ttgtgcaata ccaagaaaat agctctttaa 1680 
aagaatgtct ttgtatgtct caaggtaaat taaggattta ctgaataagg tgttgaccaa 1740 
atccagacca ttttatttta tttttttatt tatttatttt ttgagatgga gtcttgcttt 1800 
gtcgcccagg ctggagtgca gtggcgtgat ctcagctcac tgcaacctcc acctcccggg 1860 
ttcacgccat tctcctgcct cagcctcctg agtagctggg actacaggca cctgccacca 1920 
cgcctggcta actttttttt atattttgag tagaaatggg gtttcaccat gttagccagg 1980 
atggtctcaa tctcctgacc ttgtgatccg cctgccttgg cctcccaaag tgctgggatt 2040 
acaggcgtga gccactgcgc ctggccagac cattttagaa ttgggaaatt ttagtgagaa 2100 
aaaatgcact gtaaatatgc tttagtttta attcagttgg gatgcactac ctagcgaaaa 2160 
ttgagaaact atatacttct cagagaaata tctgacatct attgtcattc cattgctatt 2220 
ttttttcccc agagacttcc ataatttaaa ataaaatcct agatccagtt cttgtttttt 2280 
ggcataaata cttaatctat tttaaattta taaaatctga gcttctagga tccagctgtg 2340 
tcaaccttta tttagcatat ataactataa atcacttatt acagatgcta aatagatcac 2400 
cttttacaga tgctgaaatg tttgggatat gtttgttgac aaggtaaatg gaaatgagaa 2460 
actttatact tcagttttca gatatatgga tctagatccc aaataaatga ttaatcttca 2520 
ttggtttctc aaattcaggt tgaaatacaa attaatagcc tttattgatt ttacttttat 2580 
gagtcattgt agacatctat aaatataaaa gggcctgtac ccaaaggatg ccggaatact 2640 
agtattttta tttatcgtaa acatccacga gtgctgttgc actaccatct atttgttgta 2700 
aataaaagtg ttgttttcaa agccatcttt aaatagttct ttaaaaatag gtcttttttt 2760 
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tatattttgg aaaaggcatt gtttttaaag taaagataaa atggtaagta cctaattgta 2820 

tttactgtaa tataacatgc agatgaatgc tttataagtt aaatatgatg tattttttca 2880 

tacttctgga ttatactata attcatatga aatcttgata ttagtcccca cacggaaaaa 2940 

gtgaactgca gttgatattt ggtgtttaag atagcaccat tgtttaaata ccgcctatgt 3000 

actcccaaat gaataaaaca taattcttgt cctctgagag catacaagct tgtgtgatag 3060 

ataaatatgc attaaataat tacagctata gattaagaac tcgtaaggaa tatctcacaa 3120 

agcctggtaa gagttctgac agaagagaac tcaatttcag tcattcaaca acaaatagtc 3180 

actcaactcc tgctctttgg caggtacgtc accaagcact gaggatatag caataaataa 3240 

gccagacaat attcctgccc tcagggaatt ttcatcttgg tgtaagtgac agaaaaggaa 3300 

atatatgcat aatgtaattt caggtattta atgctataaa taaaactaca gcagaacggg 3360 

ggaatagtgt tgctatctta ggtagtatgg ttggggtcag gaaaggtctc tgagatagta 3420 

tttggaacat tttgaaagaa gtgaggaagc gagccacagt gagaggaaac acattctagg 3480 

cagagaggac aggaagtaaa aagggcctga tggaaggact aaagttggat agaggggatg 3540 

ggtgagcaca aaacggagtt ggcaaattgg ccttgccgta tgtgtggaaa gacccaggtg 3 600 

ggacgccang cccctggttc aaaataccgt tcccattctc gtttgccttg gcggatatta 3 660 

caccctatgt tatggttccg ccttcaaagg cgttaacacc cct 3703 

<210> 79 

<211> 1171 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature < 

<223> Incyte ID No: 7510413CB1 

<400> 79 

gagcacggag attcggcacg 
aaagataagt agaaagctgg 
gtccccaggg cccagcatga 
ctccatcagt ttctcctccg 
gacatggcct ggcagatgat 
gcccagccca ggagtgcgcg 
caccacaaga cacagcccag 
gggccctccc caggaagtgt 
ggaggagatg gctgtggtgg 
cctggaagaa gaatgcctgc 
cccgcctgta caactttaac 
actttatcca ggacagctgt 
gtcaaccaga gctggcgcaa 
gagcgctggt gggaggactg 
tggaattgga cctcagggat 
tcctacttcc ccactccagc 
agcaactata gtcgagggag 
aaccccaatg aggaggtggc 
cgtgggatta ttgattcctg 
ctaatagaca aatccacatg 

<210> 80 
<211> 323 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 17213 03CB1 
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aggaggaaaa gcagcaaacc 
accatgagct ccttgagggc 
ccttctcctg gattcctcat 
gcatcttttt cttacaggat 
gcagctgctg cttctggctt 
ggccaggacg gacctgctca 
ccccgaggac gagctgtatg 
tcctctggat gacctacctg 
ggagagactt agtcctgtgt 
tgcacggcca gcaccagcca 
tgggatcact gtggtaagat 
ctctgagtgc tcacccaacc 
agagcgcatt ctgaacgtgc 
tcgcacctcc tacacctgca 
taatgagtgt ccggccgggg 
cgccctttgt gaaggcctct 
cggccgctgc atccagatgt 
caagttctat gctgcggcca 
atccaagaag ggtcctctgg 
aaaaaaaaaa a 



agatggagaa ggaaggccaa 60 
agggactgaa tggttactat 120 
cttccttctg tgacctgtgt 180 
tcttacctca ggaatagatg 240 
tggtgactgc tgcggggagt 3 00 
atgtctgcat gaacgccaag 360 
gccaggtggg agctcctcaa 420 
gggcagagga gccagaatat 480 
cttccccacc cagtgcagtc 540 
ggagctgcac aaggacacct 600 
ggaacccacc tgcaagcgcc 660 
tggggccctg gatccggcag 720 
ccctgtgcaa agaggactgt 780 
aaagcaactg gcacaaaggc 840 
ccctctgcag cacctttgag 900 
ggagccactc cttcaaggtc 960 
ggtttgactc agcccagggc 1020 
tgaatgctgg ggccccgtct 1080 
ggttcttcca acaacctatt 1140 

1171 
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<400> 80 

ccaagactcc aaaatggcgt cagttggtga 
gaaacttctg gaggtcaaac tgggggagct 
tcctagtggc attttcggag cgtttcaaag 
aagaggacac actctgcacc cccccacccc 
acacaatctc aatcgttgct gaatcctttc 
aacatgactg gtacgtgtaa . aaa 

<210> 81 

<211> 1221 

<212> DNA 

<213> Homo sapiens 



gtgtccggcc ccagtaccag tgaaggacaa 60 
gccaagctgg atcttgatgc gggacttcag 120 
agagcacgag cggctccgca aataccactg 180 
acgaccttgg cccgagcccc tccgtgagga 240 
atatcctaat aggaattaac ctccaaataa 300 

323 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7502007CB1 

<400> 81 

attaactcag ctgggagttg aagagccgat 
gtccatgggc tcgggccact gtctcaggtc 
cgtgatagca aagatccagg aaatactgca 
gttcatgagc acatatgtca tgatggtatt 
aaataaaaaa tatgggagct accttggtgt 
gggagtgcac gtggcaggcc gcatctctgg 
taactgtgcg ctgggccgcg tgccctggag 
cctgggctcc ttcctggcgg ctgccaccat 
cttttcgggt ggacagctga tggtgaccgg 
ctaccttcct gatcacatga cattgtggcg 
gatgctccag ctgtgtctct tcgccatcac 
aacagaggcg ctggtgatag gcatcctcgt 
cacaggatat gccatcaacc cgtcccggga 
tggttggggc aaacaggtct tcaggtggca 
tcccacggga gcccctgaaa ttggaggatt 
tattgcccaa gatgggatct catgaaccca 
gccctgccaa cagatcttca gtccaccctg 
agcacttcta agcagagatt atttgtgatc 
ccgacaaaaa aaaaaaaaaa aaaaaaaaaa 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
cgggggcccc gacaaggggg g 



gggcagcagg cagacttgag tctcctttct 60 
cacccgtggc tccaaaatgg tctcctggtc 120 
gaggaagatg gtgcgagagt tcctggccga 180 
cggccttggt tccgtggccc atatggttct 240 
caacttgggt tttggcttcg gagtcaccat 300 
agcccacatg aacgcagctg tgacctttgc 360 
gaagtttccg gtctatgtgc tggggcagtt 420 
ctacagtctc ttctacacgg ccattctcca 480 
tcccgtcgct acagctggca tttttgccac 540 
gggcttcctg aatgaggcgt ggctgaccgg 600 
ggaccaggag aacaacccag cactgccagg 660 
ggtcatcatc ggggtgtccc ttggcatgaa 720 
cctgcccccc cgcatcttca ccttcattgc 780 
tcatctacct ggtcttcatt ggctccacca 840 
ctgtggcgta tgaagaccac gggataaccg 900 
cgatctctcc cctcaccccc gtctctgtga 960 
ccccaccctt acatgaatcc atggccctag 1020 
ccatccattc cccaataaag caaggcttgt 1080 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1140 
aaaaaaaaaa aaaaaaaagg ggggggggcg 1200 

1221 



<210> 82 

<211> 2008 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506439CB1 



<400> 82 

gcagcctcag aaggtgtgag cagtggccac 
gcagagggca ggcaagctgg cccttggtgg 
atgcttggaa agctcgctat gctgctgtgg 
cccacactcc tggcacaggg agaagccagg 
ctgaggctgt cggattacct tttgaccaac 
tggaggaagc caaccaccgt atccattgac 



gagaggcagg ctggctggga catgaggttg 60 
gcctcgtcct gagcactcgg aggcactcct 120 
gtccagcagg cgctgctcgc cttgctcctc 180 
aggagccgaa acaccaccag gcccgctctg 240 
tacaggaagg gtgtgcgccc cgtgagggac 300 
gtcattgtct atgccatcct caacgtggat 360 
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gagaagaatc aggtgctgac cacctacatc tggtaccggc agtactggac tgatgagttt 420 
ctccagtgga accctgagga ctttgacaac atcaccaagt tgtccatccc cacggacagc 480 
atctgggtcc cggacattct catcaatgag ttcgtggatg tggggaagtc tccaaatatc 540 
ccgtacgtgt atattcggca tcaacatctc tttgtggcgc ttgccagaaa aggtgaaatc 600 
cgacaggagt gtcttcatga accagggaga gtgggagttg ctgggggtgc tgccctactt 660 
tcgggagttc agcatggaaa gcagtaacta ctatgcagaa atgaagttct atgtggtcat 720 
ccgccggcgg cccctcttct atgtggtcag cctgctactg cccagcatct tcctcatggt 780 
catggacatc gtgggcttct acctgccccc caacagtggc gagagggtct ctttcaagat 840 
tacactcctc ctgggctact cggtcttcct gatcatcgtt tctgacacgc tgccggccac 900 
tgccatcggc actcctctca ttggtgtcta ctttgtggtg tgcatggctc tgctggtgat 960 
aagtttggcc gagaccatct tcattgtgcg gctggtgcac aagcaagacc tgcagcagcc 1020 
cgtgcctgct tggctgcgtc acctggttct ggagagaatc gcctggctac tttgcctgag 1080 
ggagcagtca acttcccaga ggcccccagc cacctcccaa gccaccaaga ctgatgactg 1140 
ctcagccatg ggaaaccact gcagccacat gggaggaccc caggacttcg agaagagccc 1200 
gagggacaga tgtagccctc ccccaccacc tcgggaggcc tcgctggcgg tgtgtgggct 1260 
gctgcaggag ctgtcctcca tccggcaatt cctggaaaag cgggatgaga tccgagaggt 1320 
ggcccgagac tggctgcgcg tgggctccgt gctggacaag ctgctattcc acatttacct 1380 
gctggcggtg ctggcctaca gcatcaccct ggttatgctc tggtccatct ggcagtacgc 1440 
ttgagtgggt acagcccagt ggaggagggg gtacagtcct ggttaggtgg ggacagagga 1500 
tttctgctta ggcccctcag gacccaggga atgccaggga cattttcaag acacagacaa 1560 
agtcccgtgc cctgtttcca atgccaattc atctcagcaa tcacaagcca aggtctgaac 1620 
ccttccacca aaaactgggt gttcaaggcc cttacaccct tgtcccaccc ccagcagctc 1680 
accatggctt taaaacatgc tctcttagat caggagaaac tcgggcactc cctaagtcca 1740 
ctctagttgt ggacttttcc ccattgaccc tcacctgaat aagggacttt ggaattctgc 1800 
ttctctttca caactttgct tttaggttga aggcaaaacc aactctctac tacacaggcc 1860 
tgataactct gtacgaggct tctctaaccc ctagtgtctt ttttttcttc acctcacttg 1920 
tggcagcttc cctgaacact catcccccat cagatgatgg gagtgggaag aataaaatgc 1980 
agtgaaaccc taaaaaaaaa aaaaaggg 2008 

<210> 83 

<211> 1080 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 7509243CB1 

<400> 83 

ggaggagggc agggcaggcg gcagcctggg cacaggcccc taggtgctta ctcctcacct 60 
gtttcccacc tctcccccat agccagcccc acggccctgg cagggtcctg gccacagcat 120 
gcccagtgct gggctctgca gctgctgggg tggccgggtg ctgcccctgc tgctggccta 180 
tgtctgctac ctgctgctcg gtgccactat cttccagctg ctagagaggc aggcggaggc 240 
tcagtccagg gaccagtttc agttggagaa gctgcgcttc ctggagaact acacctgcct 300 
ggaccagtgg gccatggagc agtttgtgca ggtcatcatg gaagcctggg tgaaaggtgt 36Q 
gaaccccaaa ggcaactcta ccaaccccag caactgggac tttggcagca gtttcttctt 420 
tgcaggcaca gtcgtcacta ccatagggca ctgacagcag ctgctacacc ccctcggcca 480 
gaaaactcac tccatcccac cccaaggaaa ccatcagatc caggatatgg gaacctggca 540 
cccagcacag aggcaggtta ggtcttctgt gtcttctatg ccctgttggg catcccgctt 600 
aacgtgatct tcctcaacca cctgggcaca gggctgcgtg cccatctggc cgccattgaa 660 
agatgggagg accgtcccag gcgctcccag gtactgcaag tcctgggcct ggctctgttc 720 
ctgaccctgg ggacgctggt cattctcatc ttcccaccca tggtcttcag ccatgtggag 780 
ggctggagct tcagcgaggg cttctacttt gctttcatca ctctcagcac cattggcttt 840 
ggggactatg ttgttggcac agaccccagc aagcattata tctcagtgta tcggagcctg 900 
gcagccatct ggatcctcct gggcctggcg tggctggcgc tgatcctccc actgggcccc 960 
ctgcttctgc acagatgctg ccagctctgg ctgctcagta ggggcctcgg cgtcaaggat 1020 
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ggggcagcct ctgaccccag tgggctcccc aggcctcaga agatccccat ctctgcatga 1080 

<210> 84 

<211> 2412 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7509404CB1 

<400> 84 

ctcgagccgc tcgagccgcg ggcggctggg acgcagctgc ccccgctcgg cgcccgctgc 60 
acccttagca gccactgcca cctgggcccg gagcctccac gatctcgccc ggcgattgtg 120 
ggcaggggcg cctccggatc gatcttctga aattcaagtt ttcaagatga agtttttatt 180 
gacaactgcc tttttaattt taatttcctt gtgggtggaa gaagcctatt ctaaggaaaa 240 
gtcttcaaag aaagggaagg ggaaaaagaa gcagtatcta tgcccatcag agagattgta 300 
ccattgaatg acagcatttt catacttcag tctgtatgta ctaagaaggt cagcagtcag 360 
cagaggacct tgcccgagta cctgccaact ccactagcaa tatcttgaac aggttattgg 420 
tcagttatga tcccaggata agaccaaact tcaaaggcat tcctgttgat gtagtagtca 480 
acatttttat taacagtttt ggatccattc aagaaacaac aatggactat agagttaaca 540 
tcttcctgag acaaaaatgg aatgacccca ggctgaagct ccccagtgat tttaggggtt 600 
cagatgcact gacagtggat ccaacaatgt acaagtgttt atggaaacct gatttatttt 660 
ttgcaaatga aaaaagtgcc aattttcatg atgtgaccca ggaaaacatc ctcctcttta 720 
tttttcgtga tggagatgtc cttgtcagca tgaggttatc tattactctt tcatgccctt 780 
tggacttgac attgtttccc atggatacac aacgttgcaa gatgcaactg gagagctttg 840 
gttacacaac tgatgattta cgatttatct ggcagtcagg agatcctgtg caattagaaa 900 
aaattgcctt gcctcaattt gatatcaaaa aggaagatat tgaatatggt aactgtacaa 960 
aatactataa aggcacgggc tactacacat gcgtggaagt catcttcacc ctgaggaggc 1020 
aggtcggctt ttacatgatg ggggtctacg ccccaaccct gctcattgtt gttctctcct 1080 
ggctttcctt ctggatcaac ccggacgcga gtgctgccag agtgcccctg ggttggtgag 1140 
accagatgca aaaaagtttg tacttctaag tctgatctga gatctaatga cttcagcatt 1200 
gttggaagct taccaagaga ttttgaacta tccaattatg actgctatgg aaaacccatt 1260 
gaagttaaca acggacttgg gaaatctcag gctaagaaca acaagaagcc tccccctgcg 1320 
aaacctgtta ttccaacagc agcaaagcga attgatcttt atgcaagagc attgtttcct 1380 
ttctgcttct tgttcttcaa tgttatatat tggtctatat atttatgata aatcttttcc 1440 
atttgtacaa aataaaattc catttcattg tgacctactc ctttcataaa tgccaatctg 1500 
tgagaacttt tgaattttca tagcaacatt gcattttgga tgccatttga ttgtaataaa 1560 
actgtggcac cttaattttg aatggcagca tgatcatgta atatctgtgc tctaataacg 1620 
atgtatatat gtatagtgaa catattgctt agtaacaaat gaaggacaag catactacat 1680 
aatataatcc atacaattct cttcagttag tgtaaactgc aaatactaca gataattctg 1740 
ataataaaat gatatgcacg ctgaatcctg ctatggtcac cattctaatg tatgtagtat 1800 
ttcaaatttc cttccttgta actttcaaag aaagccatct tattcttgta aaattttaga 1860 
tggtattatc acagatttaa aaaggttgta ttacatattg tttaaacttt gtaagtagaa 1920 
atatatctgt tataattata caggctctgt ggagaaataa agttcaaaaa atattaattt 1980 
gtaaaatcag ctcgttttaa agtgtgcttg tgttgtcaaa aatatcagat agtaatacac 2040 
agtgagcatt tttaaacaaa gggaaaccta tatttatgta actgtatact gaattctgac 2100 
aaaataaaaa aagatacctt attgacgaaa tatttaggat aaacaaaatt ctatttaatc 2160 
caccttaaaa cctaaatgta ttttcatgga tttcatttgt tggtacatat tacacaaaac 2220 
attgtgcctt aaaatgagtc atacatcttt taaattggaa tgcagtaata gatatgtgat 2280 
tttacatcat ttttaagaaa ccaaggggaa gtaataagtt gaaaaagaaa tccataacta 2340 
ttaaaagatt ttaacttttt tattttatta aaatgcttgc atattttaag taaaaaaaaa 2400 
aaaaaaaaaa cc 2412 

<210> 85 
<211> 1004 



82/115 



WO 03/083085 



PCT/US03/09797 



<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7509439CB1 

<400> 85 

ggctcgggac gcgcgccatt gtgttggtac ccgggaattc ggccattatg gccgggggac 60 
cttcagcagg gctgtggcta ccatgttctc tcgcgcgggt gtcgctgggc tgtcggcctg 120 
gaccttgcag ccgcaatgga ttcaagttcg aaatatggca actttgaaag atatcaccag 180 
gagactaaag tccatcaaaa acatccagaa aattaccaag tctatgaaaa tggtagcggc 240 
agcaaaatat gcccgagctg agagagagct gaaaccagct cgaatatatg gattgggatc 300 
tttagctctg tatgaaaaag ctgatatcaa ggggcctgaa gacaagaaga aacacctcct 360 
tattggtgtg tcctcagatc gaggactgtg tggtgctatt cattcctcca ttgctaaaca 420 
gatgaaaagc gaggttgcta cactaacagc agctgggaaa gaagttatgc ttgttggaat 480 
tggtgacaaa atcagaggca tactttatag ttctttgcag gtgctgaagg aaaggaatga 540 
tgatagtgtg tggaacaact caggaaatca ccaccaccca taccccaaag acctgatcca 600 
tggactcatt ctgaccagtt tctggtggca ttcaaagaag tgggaagaaa gccccccact 660 
tttggagatg cgtcagtcat tgcccttgaa ttactaaatt ctggatatga atttgatgaa 720 
ggctccatca tctttaataa attcaggtct gtcatctcct ataagacaga agaaaagccc 780 
atcttttccc tttaataccg ttgcaagtgc tgacagcatg agtatctatg acgatattga 840 
tgctgacgtg ctgcaaatta ccaagaatac aatctggcca acatcatcta ctactctctg 900 
aaggagtcca ccactagtga gcagagtgcc aggatgacag ccatggacaa tgccagcagg 960 
ctttccacac ctgcgccgta ctatagtccg gtcgtcactt ggtc 1004 

<210> 86 

<211> 5231 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7510202CB1 

<400> 86 

atgagcaaga gacgcatgag cgtgggtcag caaacatggg ctcttctctg caagaactgt 60 
ctcaaaaaat ggagaatgaa aagacagacc ttgttggaat ggctcttttc atttcttctg 120 
gtactgtttc tgtacctatt tttctccaat ttacatcaag ttcatgacac tcctcaaatg 180 
tcttcaatgg atctgggacg tgtagatagt tttaatgata ctaattatgt tattgcattt 240 
gcacctgaat ccaaaactac ccaagagata atgaacaaag tggcttcagc cccattccta 300 
aaaggaagaa caatcatggg gtggcctgat gaaaaaagca tggatgaatt ggatttgaac 360 
tattcaatag acgcagtgag agtcatcttt actgatacct tctcctacca tttgaagttt 420 
tcttggggac atagaatccc catgatgaaa gagcacagag accattcagc tcactgtcaa 480 
gcagtgaatg aaaaaatgaa gtgtgaaggt tcagagttct gggagaaagg ctttgtagct 540 
tttcaagctg ccattaatgc tgctatcata gaaatcgcaa caaatcattc agtgatggaa 600 
cagctgatgt cagttactgg tgtacatatg aagatattac cttttgttgc ccaaggagga 660 
gttgcaactg attttttcat tttcttttgc attatttctt tttctacatt tatatactat 720 
gtatcagtca atgttacaca agaaagacaa tacattacgt cattgatgac aatgatggga 780 
ctccgagagt cagcattctg gctttcctgg ggtttgatgt atgctggctt catccttatc 840 
atggccactt taatggctct tattgtaaaa tctgcacaaa ttgtcgtcct gactggtttt 900 
gtgatggtct tcaccctctt tctcctctat ggcctgtctt tgataacttt agctttcctg 960 
atgagtgtgt tgataaagaa acctttcctt acgggcttgg ttgtgtttct ccttattgtc 1020 
ttttggggga tcctgggatt cccagcattg tatacacatc ttcctgcatt tttggaatgg 1080 
actttgtgtc ttcttagccc ctttgccttc actgttggga tggcccagct tatacatttg 1140 
gactatgatg tgaattctaa tgcccacttg gattcttcac aaaatccata cctcataata 1200 
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gctactcttt tcatgttggt ttttgacacc cttctgtatt tggtattgac attatatttt 1260 
gacaaaattt tgcccgctga atatggacat cgatgttctc ccttgttttt cctgaaatcc 1320 
tgtttttggt ttcaacacgg aagggctaat catgtggtcc ttgagaatga aacagattct 1380 
gatcctacac ctaatgactg ttttgaacca gtgtctccag aattctgtgg gaaggaagcc 1440 
atcagaatca aaaatcttaa aaaagaatat gcagggaagt gtgagagagt agaagctttg 1500 
aaaggtgtgg tgtttgacat atatgaaggc cagatcact'g ccctccttgg tcacagtgga 1560 
gctggaaaaa ctaccctgtt aaacatactt agtgggttgt cagttccaac atcaggttca 1620 
gtcactgtct ataatcacac actttcaaga atggctgata tagaaaatat cagcaagttc 1680 
actggatttt gtccacaatc caatgtgcaa tttggatttc tcactgtgaa agaaaacctc 1740 
aggctgtttg ctaaaataaa agggattttg ccacatgaag tggagaaaga ggttttgcta 1800 
ttggatgaac cgactgctgg attggatcct ctttcaaggc accgaatatg gaatctcctg 1860 
aaagagggga aatcagacag agtaattctc ttcagcaccc agtttataga tgaggctgac 1920 
attctggcgg acaggaaggt gttcatatcc aatgggaagc tgaagtgtgc aggctcttct 1980 
ctgttcctta agaagaaatg gggcataggc taccatttaa gtttgcatct gaatgaaagg 2040 
tgtgatccag agagtataac atcactggtt aagcagcaca tctctgatgc caaattgaca 2100 
gcacaaagtg aagaaaaact tgtatatatt ttgcctttgg aaaggacaaa caaatttcca 2160 
gaactttaca gggatcttga tagatgttct aaccaaggca ttgaggatta tggtgtttcc 2220 
ataacaactt tgaatgaggt gtttctgaaa ttagaaggaa aatcaactat tgatgaatca 2280 
gatattggaa tttggggaca attacaaact gatggggcaa aagatatagg aagccttgtt 234fr 
gagctggaac aagttttgtc ttccttccac gaaacaagga aaacaatcag tggcgtggcg 2400 
ctctggaggc agcaggtctg tgcaatagca aaagttcgct tcctaaagtt aaagaaagaa 2460 
agaaaaagcc tgtggactat attattgctt tttggtatta gctttatccc tcaacttttg 2520 
gaacatctat tctacgagtc atatcagaaa agttacccgt gggaactgtc tccaaataca 2580 
tacttcctct caccaggaca acaaccacag gatcctctga cccatttact ggtcatcaat 2640 
aagacagggt caaccattga taacttttta cattcactga ggcgacagaa catagctata 2700 
gaagtggatg cctttggaac tagaaatggc acagatgacc catcttacaa tggtgctatc 2760 
attgtgtcag gtgatgaaaa ggatcacaga ttttcaatag catgtaatac aaaacggctg 2820 
aattgctttc ctgtcctcct ggatgtcatt agcaatggac tacttggaat ttttaattcg 2880 
tcagaacaca ttcagactga cagaagcaca ttttttgaag agcatatgga ttatgagtat 2940 
gggtaccgaa gtaacacctt cttctggata ccgatggcag cctctttcac tccatacatt 3000- 
gcaatgagca gcattggtga ctacaaaaaa aaagctcatt cccagctacg gatttcaggc 3060 
ctctaccctt ctgcatactg gtttggccaa gcactggtgg atgtttccct gtactttttg 3120 
atcctcctgc taatgcaaat aatggattat atttttagcc cagaggagat tatatttata 3180 
attcaaaacc tgttaattca aatcctgtgt agtattggct atgtctcatc tcctgttttc 3240 
ttgacatatg tgatttcatt catttttcgc aatgggagaa aaaatagtgg catttggtca 3300 
tttttcttct taattgtggt catcttctcg atagttgcta ctgatctaaa tgaatatgga 3360 
tttctagggc tattttttgg caccatgtta atacctccct tcacattgat tggctctcta 3420 
ttcatttttt ctgagatttc tcctgattcc atggattact taggagcttc agaatctgaa 3480 
attgtatacc tggcactgct aataccttac cttcattttc tcatttttct tttcattctg 3540 
cgatgcctag aaatgaactg caggaagaaa ctaatgagaa aggatcctgt gttcagaatt 3600 
tctccaagaa gcaacgctat ttttccaaac ccagaagagc ctgaaggaga ggaggaagat 3 660 
atccagatgg aaagaatgag aacagtgaat gctatggctg tgcgagactt tgatgagaca 3720 
cccgtcatca ttgccagctg tctacggaag gaatatgcag gcaaaaagaa aaattgcttt 3780 
tctaaaagga agaaaacaat tgccacaaga aatgtctctt tttgtgttaa aaaaggtgaa 3840 
gttataggac tgttaggaca caatggagct ggtaaaagta caactattaa gatgataact 3900 
ggagacacaa aaccaactgc aggacaggtg attttgaaag ggagcggtgg aggggaaccc 3960 
ctgggcttcc tggggtactg ccctcaggag aatgcgctgt ggcccaacct gacagtgagg 4020 
cagcacctgg aggtgtacgc tgccgtgaaa ggtctcagga aaggggacgc aatgatcgcc 4080 
atcacacggt tagtggatgc gctcaagctg caggaccagc tgaaggctcc cgtgaagacc 4140 
ttgtcagagg gaataaagcg aaaggtacgg gcagggcttg ttgttgctct gcaagtgccg 4200 
tagatgggaa ccagaggggc tgttcctccc gttgcagctg tgctttgtgc tgagcatcct 4260 
ggggaacccg tcagtggtgc ttctggatga gccgtcgacc gggatggacc ccgaggggca 4320 
gcagcaaatg tggcaggtga ttcgggccac ctttagaaac acggagaggg gcgccctcct 4380 
gaccacccac tacatggcag aggctgaggc ggtgtgtgac cgagtggcca tcatggtgtc 4440 
aggaaggctg agatgtattg gttccatcca acacctgaaa agcaaatttg gcaaagacta 4500 
cctgctggag atgaagctga agaacctggc acaaatggag cccctccatg cagagatcct 4560 



84/115 



WO 03/083085 



PCT/US03/09797 



gaggcttttc ccccaggctg ctcagcagga 
gcctgttgag gatgtgcgac ctttatcaca 
gagtttcgac ctggaggagt acagcctctc 
gctctccaag gagcaggagc tgggtgatct 
gaaactcctc ctgcaggaag agccttaaag 
tgtgactctt ttaaagataa tattttatag 
aaaatgcatt tgaaactcat gcaataatta 
caggcgatgt cagtgagggc gatcataggg 
gtgccagcaa ccaaatccca tgtttcctac 
gaattgtcta catttctgag gatgtcatgg 
aaaatattaa atattttctt atttattttt 
tgacttccgg g 



aaggttctcc tccctgatgg tctataagtt 4620 

ggctttcttc aaattagaga tagttaaaca 4680 

acagtctacc ctggagcagg ttttcctgga 4740 

tgaagaggac tttgatccct cggtgaagtg 4800 

ctccaaatac cctatatctt tctttaatcc 4860 

ccttaatatg ccttatatca gaggtggtac 4920 

tcctcagtag tatttcttac agtgagacaa 4980 

cataagccta agccatacea tgcagccttt 5040 

tgtgttaagt ttaaaaatgc atttattata 5100 

agaatgctta attttcttcc tctgaacttc 5160 

ttgattaaag tataaattaa gacaccctat 5220 

5231 



<210> 87 

<211> 3269 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__f eat ure 

<223> Incyte ID No: 7510203CB1 



<400> 87 

ggctttgtga ctgggtgctc ttgggaactt aaaggcccag gtctctgaga acattctcat 60 
atcttcagag aagagaaagg agtcaggttt agaatggtct ttttgtaaaa acaaaaattt 120 
ttaaactctg aaagattttt atgcagaggg atgtgaccac ctgcaagttt actgggagag 180 
aaataggagg aaatgaagct ccagccagat attaggaatt gagcgaagca tgagaaaaag 240 
ctctgcagta atgagaagta accctggaca gagtggcagg catagctctg gcggtcctct 300 
tggggatgga cattcgtccc tctctgctgg tcggaatcaa ggtttacaag gatggacttg 360 
agtcctgctc gaggtctagg ggtatccaga gcagggtggg ttagagaggg aggcctaaga 420 
gtcctgctcg aggtctaggg gtatcctgct agggtgggtt agacggggag gcctgagagg 480 
atgagaggtg ggatctgcac acgcactgag aaaggcatag tatagactca gaacagtcct 540 
ctcccaatct ccccttctac cctccaggac ctccctctga gattctgcca ctggtacaag 600 
ctgtccccaa agcctgggct gagaatggac aagagtctga ctcagccaca gcccagtcag 660 
tacagaaccc agagaaaaca aaggaggggc tggaggagga gcagagcaca tctggtcgcc 720 
tgctgcagga agaaagcaag aaggagggcg ccgtggcctt gcacgtgtac caagcttact 780 
ggaaggccgt gggccagggc ttggccttag ccatcctctt ctctctgctt ctcatgcaag 840 
ccacgcggaa cgctgctgac tggtggctct cccactggat ctctcagctg aaggctgaga 900 
atagctccca ggaggcgcaa ccctccacca gcccagcttc tatggggctc ttctctccgc 9SO 
agctgctcct cttttcccct ggaaacctct acatcccagt gttcccactg cccaaagctg 1020 
cccccaatgg ctcctcagac atccgtttct acctcaccgt gtatgcgacc attgctggtg 1080 
taaattccct ctgcaccctt ctccgggcag tgctctttgc agcaggcacc cttcaagcag 1140 
ctgccactct gcatcgccgc ctgctgcatc gagtccttat ggcaccagtg actttcttca 1200 
atgccacacc cacgggccgg atcctaaacc gcttctcctc tgatgtggcc tgtgcggatg 1260 
acagcctgcc cttcatcctc aacatcctcc tggccaacgc ggcaggcctg ctggggctcc 1320 
tggccgtgct gggctctggc ctgccctggc tgctgctcct gctgccgcct ttgagcatca 1380 
tgtactatca cgtgcagcgc cactacaggg cctcctcacg ggagctgcgg cgcctgggca 1440 
gcctcaccct gtctccactg tatagccatc tggccgatac cttggctggc ctctctgtgc 1500 
tccgggccac aggggccacc tacaggtttg aggaggagaa cctgcgactc cttgagctaa 1560 
accagaggtg ccagtttgcc accagtgcca caatgcagtg gctggacatt cggctacagc 1620 
tcatgggggc ggcagtggtc agcgctatcg caggcatcgc tctggtgcag caccagcagg 1680 
gcctcgctaa cccagggctg gtgggcttgt cgctgtctta tgccctgtcc ctgacgggcc 1740 
tgctctcggg cctggtgagc agcttcacac agacagaggc catgctggtg agcgtcgagc 1800 
ggctggaaga gtacacctgt gacctgcccc aggaacccca gggccagcca ctgcaggtgg 1860 
gcctgtaccc ccaccccagg ccaaagctct ggaaccctga aggccccagt ctccctcaca 1920 
attcctttct ttttgcccac ccatctttct cagctcccat aacctctctt catgatgacc 1980 
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acaattcttc accatgtccc ttcttcccca 
ccatttctca ttattctccc ctcctcacca 
ccgtctgtct cccacccatg gaccccacca 
cgtggagttc caggacgtgg tgttggcgta 
agtgaccttc tgcgtgcagc ctggagagaa 
caagtcttcc ctgttgttgg tgctcttccg 
gctggacggc gtggacacca gccagctgga 
catcccccag gagccctttt tgttcagtgg 
cctacataag gacagggcct tgtggcaggc 
tacatccatg ggtggtctgg atggtgagct 
gcagaggcag ctgttgtgtt tggccagggc 
cgatgaggcc acagcaagtg tggaccagaa 
caaacgcttt gccaacaaga cagtgctgac 
ctcagaccgg gtgctggtgc tacaagcggg 
cctgcgcaac cagccccact ccctgttcca 
ccctgcctca ctcggaggtc cctgagccca 
tctgatccag gccgggccta tacagaggtg 
ctctacctct ccacacttcc ccagaaggga 
cactccttgg tgggcagcat cctgaggctt 
ttgcatctgg aacgccaggt gggtttttct 
tttatttgat aaaattccat cttacattct 
agaaaaaaaa aaaaaaaaaa atggttcgg 



tctctcattc tctcattcct ctcacactgt 2040 
tcgctcctca tctcccctat ctccctttcc 2100 
gctgggcacc ggctggctga cccagggggg 2160 
ccggccaggg ctgccgaatg ccctggatgg 2220 
gttgggcatc gtgggccgca caggctccgg 2280 
gctgctagag cccagttcag ggcgagtgct 2340 
gctggcccag ctcagatccc agttggctat 2400 
gactgttcgg gaaaacctgg acccccaggg 2460 
cctgaagcag tgccacctga gtgaggtgat 2520 
gggtgagggg ggccggagct tatctcttgg 2580 
tctcctcaca gatgccaaga tcctgtgtat 2640 
gacagaccag ctgctccagc agaccatctg 2700 
cattgcccat aggctcaaca cgatcctgaa 2760 
gagagtggta gagctggact ccccggccac 2820 
gcagctgctg cagagcagcc agcagggagt 2880 
atcccacacc ctgcagagtt ctcccctctc 2940 
ctggctgctt gtttacattc tcctctgggg 3000 
aaagggcacc ctggattact ctttggaaat 3060 
ccccagaacc aggcctctgc tctggccctc 3120 
ggcataggag cccacttgca ttttcatagt 3180 
gtgtattaaa aaaataatat ttctggtgtg 3240 

3269 



<210> 88 

<211> 7706 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7510208CB1 

<400> 88 

gcgccccgcc cccgcgcggg cgatgcccag 
cgcgcagagg agcgggccgc ggcgctgagg 
tgcaccagct gcagctgctg ctctggaaga 
tcctggcctt cgagatcttc atccccctgg 
agaagaagcc caccatctcc gtgaaggaag 
ctgccggcat cctgcctgtc atgcaatcgc 
gcttcctgca gtacgccaac tccacggtca 
tggaggaagg caacctgttt gacccagcgc 
tacgccagca tctggaggcc ctcagtgcgg 
gatccacagt gtcttccttc tctctggact 
gtttcctgac gcaaaacttg tcgctgccca 
gtgtggaccc gcccgaggtc taccacctgc 
agtctggcct ccacaagggt caggagccct 
ggatggagga gctgctgctg gctcctgccc 
cgggggagct gggccggatc ctcactgtgc 
accgggatgc tgtctgcagt gggcaggctg 
ctgctgagct ccggaaccag ctggacgtgg 
cccccaacgg ctcggactcc tcgccacagg 
tgggggacct gctggatgcc cagaaggttc 
ccctgctact gccccagggt gcctgcactg 
cgggtggggc ggccaatggc actggggcag 
aggagggcgc accctctgct gcagcactgg 
cagccttcgt acagctctgg gccggcctgc 



cggcgcggcg ggctgcgggg cccggcgggg 60 
cggcggagcg tggccccgcc atgggcttcc 120 
acgtgacgct caaacgccgg agcccgtggg 180 
tgctgttctt tatcctgctg gggctgcgac 240 
tctccttcta cacagcggcg cccctgacgt 300 
tgtgcccgga cggccagcga gacgagttcg 360 
cgcagctgct tgagcgcctg gaccgcgtgg 420 
ggcccagcct gggctcagag ctcgaggccc 480 
gcccgggcac ctcggggagc cacctggaca 540 
cggtggccag aaacccgcag gagctctggc 600 
atagcacggc ccaagcactc ttggccgccc 660 
tctttggtcc ctcatctgcc ctggattcac 720 
ggagccgcct agggggcaat cccctgttcc 780 
tcctggagca gctcacctgc acgccgggct 840 
ctgagagtca gaagggagcc ctgcagggct 900 
ctgcgcgtgc caggcgcttc tctgggctgt 960 
ccaaggtctc ccagcagctg ggcctggatg 1020 
cgccaccccc acggaggctg caggcgcttc 1080 
tgcaggatgt ggatgtcctg tcggccctgg 1140 
gccggacccc cggaccccca gccagtggtg 1200 
gggcagtcat gggccccaac gccaccgctg 1260 
ccaccccgga cacgctgcag ggccagtgct 1320 
agcccatctt gtgtggcaac aaccgcacca 1380 
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ttgaacccga ggcgctgcgg cggggcaaca 
agcggaacct gggcctcctc gtgcacctca 
ctgcgggctc tgaggtcgac cgcgtcatcc 
gcaacgtgac tcactatgcc caggtctggc 
tggagcaggg caggctgcag caacacctgc 
ggctgcaccc cgaggcactg aacctgtcac 
acaacttctc gctgcccagt ggcatggccc 
cggcctgcgg ctggatccag ttcatgtcca 
ccgacgagga gagcattgtc aactacaccc 
tttttgccag tgtgatcttc cagacccgga 
acaagatccg ccagaactcc agcttcaccg 
ggcggcctgg gcccaatact ggcggccgct 
aggacatgat ggagcgcgcc atcatcgaca 
gcagctacgt gcagatgttc ccctacccct 
ttgagcacat gatgccgctg tgcatggtga 
tccagcacat cgtggcggag aaggagcacc 
ccctggaggc gcgggctggc agggaggggc 
ctggcccacc cgcaggtgat gaagaccatg 
tggttcatca ccggctttgt gcagctgtcc 
aagtacggcc aggtgcttat gcacagccac 
tacgcggtgg ccaccatcat gttctgcttc 
ctggcctcgg cctgcggtgg catcatctac 
gcgatccgag aggaggtggc gcatgataag 
ctcatgtcca cgacggcctt tggtctgggc 
ggcgtgggca tccagtggca caccttcagc 
ttgctcctgg ctgtcaccat gctgatggtg 
tacattgagg ctgtgcaccc aggcatgtac 
cagaagtcct actggctggg cagtgggcgg 
gcacgcaccc cccgcctcag tgtcatggag 
cgctttgagg agacccgtgg catggaggag 
gtggacaaac tcaccaaggt ctacaaggac 
ctgaacctct acgagaacca ggtggtctcc 
accaccatgt ccatcctgac cggcctgttc 
gggcacgaca tccgcacgga gatggatgag 
cacaafcgtgc tctttgaccg gctcacggtg 
aagagcatgg ctcaggagga gatccgcaga 
ctctccaaca aacggcactc actggtgcag 
tccgtggcca tcgccttcgt gggcggctct 
ggcgtggacc cctacgcgcg ccgcgccatc 
cgcaccatcc ttctgtccac ccaccacatg 
gc.catcatcg tcccatggga agctcaagtg 
ctatggcgac gggtaccgcc tcacgctggt 
agagccaggg ctggcatcca gccccccagg 
ccaggtgtcc cagttcatcc gcaagcatgt 
cacggagctc tcctacatcc tgcccagcga 
cttccagcac ctggagcgca gcctggatgc 
cacgaccctg gaggaagtgt tcctcaaggt 
tgaggccgat gtgaaggagt ccaggaagga 
tggggagggt cacgctggca atctggcccg 
gctgcagtcg gcgtcatctg tgggctctgc 
cgtctatggc gactaccgcc ccctctttga 
gcaagaggtg gaggcagagg ccctgtcgag 
cgggtggctg aaggtgcgcc agttccacgg 
ccgcaactcc aaggcactct tctcccagat 
catgaccgtg gccctgtccg tcccggagat 
ttcccagtac cacaactaca cccagccccg 



tgagctccct gggcttcacg agcaaggagc 1440 
tgaccagcaa ccccaaaatc ctgtacgcgc 1500 
tcaaggccaa cgagactttt gcttttgtgg 1560 
tcaacatctc ggcggagatc cgcagcttcc 1620 
gctggctgca gcagtatgta gcagagctgc 1680 
tggatgagct gccgccggcc ctgagacagg 1740 
tcctgcagca gctggatacc attgacaacg 1800 
aggtgagcgt ggacatcttc aagggcttcc 1860 
tcaaccaggc ctaccaggac aacgtcactg 1920 
aggacggctc gctcccgcct cacgtgcact 1980 
agaaaaccaa cgagatccgc cgcgcctact 2040 
tctacttcct ctacggcttc gtctggatcc 2100 
cttttgtggg gcacgacgtg gtggagccag 2160 
gctacacacg cgatgacttc ctgtttgtca 2220 
tctcctgggt ctactccgtg gccatgacca 2280 
ggctcaagga ggtgcgcggg ccgggactct 2340 
ggcgtcctcc ccggggcctg ccccaagccc 2400 
ggcctgaaca acgcggtgca ctgggtggcc 2460 
atctccgtga cagcactcac cgccatcctg 2520 
gtggtcatca tctggctctt cctggcagtc 2580 
ctggtgtctg tgctgtactc caaggccaag 2640 
ttcctgagct acgtgcccta catgtacgtg 2700 
atcacggcct tcgagaagtg catcgcgtcc 2760 
tctaagtact tcgcgctgta tgaggtggcc 2820 
cagtccccgg tggaggggga cgacttcaac 2880 
gacgccgtgg tctatggcat cctcacgtgg 2940 
gggctgcccc ggccctggta cttcccactg 3 000 
acagaagcct gggagtggag ctggccgtgg 3 060 
gaggaccagg cctgtgccat ggagagccgg 3120 
gagcccaccc acctgcctct ggttgtctgc 3180 
gacaagaagc tggccctgaa caagctgagc 3240 
ttcttgggcc acaacggggc gggcaagacc 3300 
cctccaacgt cgggttccgc caccatctac 3360 
atccgcaaga acctgggcat gtgcccgcag 3420 
gaggaacacc tctggttcta ctcacggctc 3480 
gagatggaca agatgatcga ggacctggag 3540 
acattgtcgg gtggcatgaa gcgcaagctg 3600 
cgcgccatca tcctggacga gcccacggcg 3660 
tgggacctca tcctgaagta caagccaggc 3720 
gatgaggctg acctgcttgg ggaccgcatt 3780 
ctgcggctcc ccgctcttcc tcaagggcac 3840 
caagcggccc gccgagccgg ggggccccca 3900 
tcgggccccg ctgagcagct gctccgagct 3960 
ggcctcctgc ctgctggtct cagacacaag 4020 
ggccgccaag aagggggctt tcgagcgcct 4080 
actgcacctc agcagcttcg ggctgatgga 4140 
gtcggaggag gatcagtcgc tggagaacag 4200 
tgtgctccct ggggcggagg gcccggcgtc 4260 
gtgctcggag ctgacccagt cgcaggcatc 4320 
ccgtggcgac gagggagctg gctacaccga 4380 
taacccacag gacccagaca atgtcagcct 4440 
ggtcggccag ggcagccgca agctggacgg 4500 
gctgctggtc aaacgcttcc actgcgcccg 4560 
cttgctgcca gccttcttcg tctgcgtggc 4620 
tggtgatctg cccccgctgg tcctgtcacc 4680 
tggcaatttc atcccctacg ccaacgagga 4740 
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gcgccgcgag taccggctgc ggctatcgcc cgacgccagc ccccagcagc tcgtgagcac 4800 
gttccggctg ccgtcggggg tgggtgccac ctgcgtgctc aagtctcccg ccaacggctc 4860 
gctggggccc acgttgaacc tgagcagcgg ggagtcgcgc ctgctggcgg ctcggttctt 4920 
cgacagcatg tgtctggagt ccttcacaca ggggctgcca ctgtccaatt tcgtgccacc 4980 
cccaccctcg cccgccccat ctgactcgcc agcgtccccg gatgaggacc tgcaggcctg 5040 
gaacgtctcc ctgccgccca ccgctgggcc agaaatgtgg acgtcggcac cctccctgcc 5100 
gcgcctggta cgggagcccg tccgctgcac ctgctctgcg cagggcaccg gcttctcctg 5160 
ccccagcagt gtgggcgggc acccgcccca gatgcgggtg gtcacaggcg acatcctgac 5220 
cgacatcacc ggccacaatg tctctgagta cctgctcttc acctccgacc gcttccgact 5280 
gcaccggtat ggggccatca cctttggaaa cgtcctgaag tccatcccag cctcatttgg 5340 
caccagggcc ccacccatgg tgcggaagat cgcggtgcgc agggctgccc aggttttcta 5400 
caacaacaag ggctatcaca gcatgcccac ctacctcaac agcctcaaca acgccatcct 5460 
gcgtgccaac ctgcccaaga gcaagggcaa cccggcggct tacggcatca ccgtcaccaa 5520 
ccaccccatg aataagacca gcgccagcct ctccctggat tacctgctgc agggcacgga 5580 
tgtcgtcatc gccatcttca tcatcgtggc catgtccttc gtgccggcca gcttcgttgt 5640 
cttcctcgtg gccgagaagt ccaccaaggc caagcatctg cagtttgtca gcggctgcaa 5700 
ccccatcatc tactggctgg cgaactacgt gtgggacatg ctcaactacc tggtccccgc 5760 
tacctgctgt gtcatcatcc tgtttgtgtt cgacctgccg gcctacacgt cgcccaccaa 5820 
cttccctgcc gtcctctccc tcttcctgct ctatgggtgg tccatcacgc ccatcatgta 5880 
cccggcctcc ttctggttcg aggtccccag ctccgcctac gtgttcctca ttgtcatcaa 5940 
tctcttcatc ggcatcaccg ccaccgtggc caccttcctg ctacagctct tcgagcacga 6000 
caaggacctg aaggttgtca acagttacct gaaaagctgc ttcctcattt tccccaacta 6060 
caacctgggc cacgggctca tggagatggc ctacaacgag tacatcaacg agtactacgc 6120 
caagattggc cagtttgaca agatgaagtc cccgttcgag tgggacattg tcacccgcgg 6180 
actggtggcc atggcggttg agggcgtcgt gggcttcctc ctgaccatca tgtgccagta 6240 
caacttcctg cggcggccac agcgcatgcc tgtgtctacc aagcctgtgg aggatgatgt 6300 
ggacgtggcc agtgagcggc agcgagtgct ccggggagac gccgacaatg acatggtcaa 6360 
gattgagaac ctgaccaagg tctacaagtc ccggaagatt ggccgtatcc tggccgttga 6420 
ccgcctgtgc ctgggtgtgc gtcctggcga gtgcttcggg ctcctgggcg tcaacggtgc 6480 
gggcaagacc agcaccttca agatgctgac cggcgacgag agcacgacgg ggggcgaggc 6540 
cttcgtcaat ggacacagcg tgctgaagga gctgctccag gtgcagcaga gcctcggcta 6600 
ctgcccgcag tgtgacgcgc tgttcgacga gctcacggcc cgggagcacc tgcagctgta 6660 
cacgcggctg cgtgggatct cctggaagga cgaggcccgg gtggtgaagt gggctctgga 6720 
gaagctggag ctgaccaagt acgcagacaa gccggctggc acctacagcg gcggcaacaa 6780 
gcggaagctc tccacggcca tcgccctcat tgggtaccca gccttcatct tcctggacga 6840 
gcccaccaca ggcatggacc ccaaggcccg gcgcttcctc tggaacctca tcctcgacct 6900 
catcaagaca gggcgttcag tggtgctgac atcacacagc atggaggagt gcgaggcgct 6960 
gtgcacgcgg ctggccatca tggtgaacgg tcgcctgcgg tgcctgggca gcatccagca 7 020 
cctgaagaac cggtttggag atggctacat gatcacggtg cggaccaaga gcagccagag 7080 
tgtgaaggac gtggtgcggt tcttcaaccg caacttcccg gaagccatgc tcaaggagcg 7140 
gcaccacaca aaggtgcagt accagctcaa gtcggagcac atctcgctgg cccaggtgtt 7200 
cagcaagatg gagcaggtgt ctggcgtgct gggcatcgag gactactcgg tcagccagac 7260 
cacactggac aatgtgttcg tgaactttgc caagaagcag agtgacaacc tggagcagca 7320 
ggagacggag ccgccatccg cactgcagtc cccctctcgg ctgcttgctc agcctgctcc 7380 
ggccccggtc tgcccccacg gagctccggg cacttgtggc agacgagccc gaggacctgg 7440 
acacggagga cgagggcctc atcagcttcg aggaggagcg ggcccagctg tccttcaaca 7500 
cggacacgct ctgctgacca cccagagctg ggccagggag gacacgctcc actgaccacc 7560 
cagagctggg ccagggactc aacaatgggg acagaagtcc cccagtgcct gccagggcct 7620 
ggagtggagg ttcaggacca aggggcttct ggtcctccag cccctgtact cggccatgcc 7680 
ctgcggtcac tgcggttgcc ggccct 7705 



<210> 89 

<211> 3159 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 

<223> Incyte ID No: 7510446CB1 

<400> 89 

agcaagagca gaggcttaag gagctacact 
cctggccggg gctcgggggg agggaatatg 
gaacaaagct ggtggggtag tgacccccag 
agctacggac tgccctctga gaatgggggc 
ccccgccaca acgtccaccc cacacagatt 
agggagcagg acatagggat gcccaagaag 
gatgaggatc actattctaa atgtcaaggt 
gtagaaacag attgtatcca ccgcctggga 
tggatctttc tggtgcttct gggactgctg 
gtcagtgcca aaagccttca ggcctacaag 
cctctgcagt tcctggtctg ggtcaccttc 
ttctgccacc tcatctctcc ccaggctgtt 
cttcgtgggg ttgtcctgaa ggaatacctc 
gccctgactg cgggcctggg cagtggcatc 
attgccagca tctgtgctgc tgtcctcagc 
gagcagccat actactactc tgatatcctg 
tgttttggga caccacttgg aggagtgcta 
gctgttcgga actactggag aggattcttt 
gtgctggcag tgtggaacaa ggatgctgtc 
cgaatggatt tcccctttga cctgaaggaa 
tgtgggctcc tgggagctgt atttgtgtat 
aagcacaagg ccctcagcca gtttcttgct 
acctttgtca ttgcctcatt caccttccca 
ttgatgcccc gcgaagccat cagtactttg 
ggtgatcctg agagcctggg ccagtcagct 
atcatcatct ttctcttctt cgtcatgaag 
cccataccct gcggaggctt catgcctgtg 
gtaggagaaa tcatggccat gctctttcct 
aagatcctac ctgggggcta tgcagtaatt 
cacacagtct ccacagctgt gatttgcttc 
cccatgatgg tggctgttat cttggccaac 
tatgacagca tcatccaggt caagaagcta 
ctcagcaaat ataccatctt tgttgaggac 
gcttcttaca catatgggga gttgcgaacc 
ccactggttg actcaaaaga ttcaatgatc 
caggccctcc tgcagcgcca cctgtgtcct 
gcgcggaagt tgtcggagct gccttacgac 
cccggcgcgc ctccaggccg gcccgagtcc 
gacctctctg gcaagagcga gcttcctctt 
cctctgtccc cagaagagcc caatgggcct 
ccagagcctg caggtcaaag accctccatc 
agagctcgcc ccacaaagaa gaaaacaacc 
tcacctgaag agattgaggc ctgggagcag 
tcctgctgta ttgaccagtc tcccttccag 
cataccctgt tttcactcct tggcctccac 
aggggcgtcc tggccctgga ggagctacag 
gtgcagctcc gccctcccct tgccagcttc 
ggggcacctc catcttctgc agagaactgg 
ggaacagggg atgtgattgc tgcctcccca 
ccccctctct ccctggcccc aggcaaggta 
gagagtccag ggctggaaga ggagctggcc 



gggggaagga caggggcaag caggccaagg 60 
gagcaatccc ggtcacagca gcgtgggggt 120 
taccagtata tgccctttga acactgcacc 180 
ctccagcaca ggctccggaa ggatgcaggc 240 
tatggccatc acaaagaaca attctcagac 300 
acaggctcca gttctaccgt ggacagcaag 360 
gatggggact gaggaataaa gaaatctgga 420 
caggtggtga gaagaaaatt aggggaagac 480 
atggctctgg tcagctggag catggactac 540 
tggtcctacg cgcagatgca gcccagcctt 600 
ccactagtcc tcatcctctt cagcgccctc 660 
ggctctggaa tccccgaaat gaagacaata 720 
acaatgaaag cctttgtggc caaggttgtc 780 
cccgtgggga aagagggccc cttcgtccac 840 
aaattcatgt ctgtgttctg cggggtatat 900 
acggtgggct gtgctgtggg agtcggctgt 960 
tttagcatcg aggtcacctc cacctacttt 1020 
gcagccacgt tcagcgcctt tgtgtttcga 1080 
accatcactg ctctgttcag aaccaatttc 1140 
ctaccagctt ttgctgccat cgggatttgc 1200 
ctgeatcgcc aagtcatgct cggtgtccga 1260 
aagcaccgcc tgctgtatcc tggaattgtt 1320 
ccaggaatgg gtcaattcat ggctggagag 1380 
tttgacaaca atacatgggt gaaacacgcg 1440 
gtgtggattc acccccgggt caacgttgtc 1500 
ttctggatgt ccatcgtggc caccactatg 1560 
tttgtgctag gagctgcatt tggaaggctg 1620 
gatggtattt tgtttgatga catcatctac. 1680 
ggagcagcag cgctgactgg tgccgtttcc 1740 
gaattaacgg gtcagattgc tcacatcctg 1800 
atggtggccc agagcctgca gccctctctc 1860 
ccctacttgc ctgaccttgg ctggaaccag 1920 
atcatggtac gtgatgtgaa gtttgtttca 1980 
ctgctccaga ccaccacagt caagacttta 2040 
ctgctgggct cggtggagcg gtcggaactg 2100 
gagcgcagac tgcgcgcagc ccaagagatg 2160 
gggaaggcgc ggctggctgg ggaggggctc 2220 
ttcgcctttg tggatgagga tgaggacgaa 2280 
tcccttgctc tccacccctc tactactgcc 2340 
ctgcctggcc acaaacagca gccggaagca 2400 
ttccagtccc tgcttcactg cttgctgggc 2460 
caggattcca cagatttagt ggataacatg 2520 
gagcagctga gccagcctgt ctgttttgat 2580 
ctggtggagc agacaaccct gcacaagact 2640 
ctcgcttacg tgaccagcat ggggaagctc 2700 
aaggccattg aggggcacac caagtctggg 2760 
cggaacacga cttcaactcg aaagagtacc 2820 
aacctgcctg aggacaggcc tggggccact 2880 
gagacccctg tgccatctcc ttccccagag 2940 
gagggcgagt tggaggagct ggagctggtg 3000 
gacatcttgc agggccccag cctgcgatcc 3060 



89/115 



WO 03/083085 



PCT/US03/09797 



acagacgagg aggatgagga tgaactgatc ctttgacccc ctcccacgac ctcctcataa 3120 
agaccgtgga gaggcccaaa aaaaaaaaaa aaaaaaaaa 3159 

<210> 90 
<211> 1821 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_£eature 

<223> Incyte ID No: 7505294CB1 

<400> 90 

gctgcctgcc ggtgctcttc gtggctctgg gcatggcctc ggaccccatc ttcacgctgg 60 
cgcccccgct gcattgccac tacggggcct tcccccctaa tgcctctggc tgggagcagc 120 
ctcccaatgc cagcggcgtc agcgtcgcca gcgctgccct agcagccagc gccgccagcc 180 
gtgtcgccac cagtaccgac ccctcgtgca gcggcttcgc cccgccggac ttcaaccatt 240 
gcctcaagga ttgggactat aatggccttc ctgtgctcac caccaacgcc atcggccagt 300 
gggatctggt gtgtgacctg ggctggcagg tgatcctgga gcagatcctc ttcatcttgg 360 
gctttgcctc cggctacctg ttcctgggtt accccgcaga cagatttggc cgtcgcggga 420 
ttgtgctgct gaccttgggg ctggtgggcc cctgtggagt aggaggggct gctgcaggct 480 
cctccacagg cgtcatggcc ctccgattcc tcttgggctt tctgcttgcc ggtgttgacc 540 
tgggtgtcta cctgatgcgc ctggagctgt gcgacccaac ccagaggctt cgggtggccc 600 
tggcagggga gttggtgggg gtgggagggc acttcctgtt cctgggcctg gcccttgtct 660 
ctaaggattg gcgattccta cagcgaatga tcaccgctcc ctgcatcctc ttcctgtttt 720 
atggctggcc tggtttgttc ctggagtccg cacggtggct gatagtgaag cggcagattg 780 
aggaggctca gtctgtgctg aggatcctgg ctgagcgaaa ccggccccat gggcagatgc 840 
tgggggagga ggcccaggag gccctgcagg cttcattgcc catgccattc gccactgcta 900 
ccagcctgtg ggaggaggag ggagcccatc ggacttctac ctgtgctctc tgctggccag 960 
cggcaccgca gccctggcct gtgtcttcct gggggtcacc gtggaccgat ttggccgccg 1020 
gggcatcctt cttctctcca tgacccttac cggcattgct tccctggtcc tgctgggcct 1080 
gtgggattat ctgaacgagg ctgccatcac cactttctct gtccttgggc tcttctcctc 1140 
ccaagctgcc gccatcctca gcaccctcct tgctgctgag gtcatcccca ccactgtccg 1200 
gggccgtggc ctgggcctga tcatggctct aggggcgctt ggaggactga gcggcccggc 1260 
ccagcgcctc cacatgggcq atggagcctt cctgcagcac gtggtgctgg cggcctgcgc 1320 
cctcctctgc attctcagca ttatgctgct gccggagacc aagcgcaagc tcctgcccga 1380 
ggtgctccgg gacggggagc tgtgtcgccg gccttccctg ctgcggcagc caccccctac 1440 
ccgctgtgac cacgtcccgc tgcttgccac ccccaaccct gccctctgag cggcctctga 1500 
gtaccctggc gggaggctgg cccacacaga aaggtggcaa gaagatcggg aagactgagt 1560 
agggaaggca gggctgccca gaagtctcag aggcacctca cgccagccat cgcggagagc 1620 
tcagagggcc gtccccaccc tgcctcctcc ctgctgcttt gcattcactt ccttggccag 1680 
agtcagggga cagggagaga gctccacact gtaaccactg ggtctgggct ccatcctgcg 1740 
cccaaagaca tccacccaga cctcattatt tcttgctcta tcattctgtt tcaataaaga 1800 
catttggaat aaacgagcaa a 1821 

<210> 91 

<211> 3526 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7505631CB1 

<400> 91 

cacttggcgg ccatggcagc tgtagtatcg gcgactccgg gtcaaggccc ggtcgagtgc 60 
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agtaccatgg gcagcaccgg gtatagggca 
acccctagga cccatcgtta gagacctgca 
agagtttgct gggactggtg ggctggtttc 
cgcctcttgg agacaggggc gcctagggaa 
cgccgcctaa gacattgtgc caccctcaat 
attttcccca ttgacttttc ccatctctgt 
gagaacccag agcctgggac cttagattgc 
aaaagggtct- gttgccgggt acgttcaaga 
gggaaggcct aaagaactgg aaagcccact 
cctaagcacc atttaaagcc actggaaatt 
agggcagaat ggatgatttc atctccatta 
gttacgtggc cggaatcatt cccttggctg 
tgactgtttt gggtgctggc cttctctgtg 
gagtacatgc cctttatgaa gatattcttg 
ataatgtgat tgcatcagac aaagcagcag 
gccacgacca cacacagctg catgcctata 
tcatgttgct ggtggaccag attggtaact 
cagcaaggtc tagcaattcc aaaatcacca 
ctgatggtgt tgctttggga gcagcagcat 
tgtttgtggc aatcatgcta cataaggcac 
tgcatgctgg cttagagcgg aatcgaatca 
caccagttat gtccatggtg acatacttag 
cagaggtgaa cgccacggga gtggccatgc 
ccacagtgag gaaagtagca caaataggat 
catacaattt tactaccaag agaaggtata 
ggatgttaac agctgactgg tgtgagactt 
ggttgcctag attctctctg gaaactgact 
ggtttggttt ggacagtagt gctttctatc 
actggccaaa gcctctttca gcagtgcctt 
tcttgctgga tggagccttg aactccggca 
cttcccctct ggacctcact attaacaagc 
agctatgtat gggctttccc agattttaaa 
cctggtcagc agacagaaat agccatacta 
agcagggaac caagcagcgt ggcacaggcc 
ggtgggcagt attccaggag aggccatgtc 
gttttcctct tgtttaaaac tgcctcttta 
tgaaaacatt ggcaatactt aagttgctgc 
aagagacgca attgatgatg agaagcatga 
taattgcaat ttgactccgt ttccttggta 
ctcttaaatg gcaaattaag ttaaagaata 
agttaattgc ttgtcagttc catttcaaga 
ttcctgtgca cactattgcc aaattttttt 
agttggaaac agtactacct acctagaggt 
ctctttcttt cccaattcaa aacagccaag 
gagaaaagaa agggagtggc tgttttgagt 
gattgtgttt tttttttacc agcctacttc 
caaggattag aactaagagg acacaccagc 
ggtagctagg gactgaacac aggaaccgta 
cctgccttgt gggcccctga gccccttggg 
aactgctgca gtgttaagtg aaaatcctct 
attagaaaat ttctgctcaa tacagaatgg 
tgctgtgaaa ttaaaacctc tttgtacctg 
aggaaatgtg taacttgtgg gaaagaacta 
tcagcctaac ccagtggctt aacacaagga 
tcttccagca agttagaagt ctcatgggat 
gaacagtgat tttaaatgtc cctttttctg 



gagacagctt tgtgtcaact ttgctgctga 120 
ggactccttt cctcatccca ggctcggagg 180 
ctgctctggg gggcggatca ccttcggggc 240 
cgaacaggtt cgcttgagtc acttacccgc 300 
ccgacaatcg aagaaatcga tcattcgcac 360 
taacccacga gaatctaatg actggcatct 420 
tgtaagcttt ctctggtgct aatatcagca 480 
ggaaggtgcc tcgtgaacac atctgctggt 540 
ctcttggaac caccacacct gtttaaagaa 600 
tgttgtctag tggttgtggg tgaataaagg 660 
gcctgctgtc tctggctatg ttggtgggat 720 
ttaatttctc agaggaacga ctgaagctgg 780 
gaactgctct ggcagtcatc gtgcctgaag 840 
agggaaaaca ccaccaagca agtgaaacac 900 
aaaaatcagt tgtccatgaa catgagcaca 960 
ttggtgtttc cctcgttctg ggcttcgttt 1020 
cccatgtgca ttctactgac gatccagaag 1080 
ccacgctggg tctggttgtc catgctgcag 1140 
ctacttcaca gaccagtgtc cagttaattg 1200 
cagctgcttt tggactggtt tccttcttga 1260 
gaaagcactt gctggtcttt gcattggcag 1320 
gactgagtaa gagcagtaaa gaagcccttt 1380 
ttttctctgc cgggacattt ctttatgttg 1440 
acagttgtat gtagtcattg gcaacaattg 1500 
gtatggaaag tccaaatgac ttccttgatt 1560 
gaggtttcat ctagtccttc aaaactatat 1620 
ttgtcaaata aatagcagat tgtagtgtct 1680 
atattgttgt gtgcaatggt aatttgttct 1740 
gccatcatgc ttaaaagttt ggctagtata 1800 
aggattgaac catctgactt ccaaatttgc 1860 
aaacctttca gggccctctt agctctcaga 1920 
gctgctgcct cgagaactac tcatttctct 1980 
atctcatagg gctcaaatgc atcttcaggc 2040 
ttcttgactg gaggaagagc ttgctggcat 2100 
cgtgttcact tcttggcaca tttcagttcc 2160 
gatgtggatg ccttaatgct gtaacacatt 2220 
catgattaca gatggaatta ttggctacca 2280 
ttcttgcttc catataacca aagttaatct 2340 
gggatagact ttcttcagat tccaagtgct 2400 
ctactgctcc attcccctca cttattctcc 2460 
aagcagtgat gttccaggtt tgattcagtt 2520 
ttagcaaaga ttctgcactg gaacgtagac 2580 
tatgtgtttt ctctttctcc ccgctttcac 2640 
tgagccctgt tctggtattt tgaatcatta 2700 
tgtcctttct ttgcagaaag gagaaaatgt 2760 
taagtgtcac tgcctggttt ttctcttttt 2820 
atcggagtgt attaagcccc tgaaacacat 2880 
tgacagcagc acaaaccccc aaaggatgtt 2940 
agactgagaa tcatgaccag attcatccag 3000 
gtagttgttc tgcagaggaa ccttccttcc 3060 
tccacatcac ccaaagtgca ctgttggaga 3120 
agacatctag attcacctca ggaggcctga 3180 
gacaaccatt taggaattct ctagatatac 3240 
gattggcttt gatctttttt tcttgtggca 3300 
aagactgcag ttcccctggt tcaatagctg 3360 
gatcccttgt aaacatgaaa tcattccatg 3420 
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gatggctgcc ttataatttt gtctctttcc actttaattg tgaatggtta aaaaaatgct 3480 
gttttctgat attaaatttt tattagtgca taccttaaaa aaaaaa 3526 

<210> 92 

<211> 946 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7506561CB1 

<400> 92 

cacggagatt cggcacgagg aggaaaagca gcaaaccaga tggagaagga aggccaaaaa 60 
gataagtaga aagctggacc atgagctcct tgagggcagg gactgaatgg ttactatgtc 120 
cccagggccc agcatgacct tctcctggat tcctcatctt ccttctgtga cctgtgtctc 180 
catcagtttc tcctccggca tctttttctt acaggattct tacctcagga atagatggac 240 
atggcctggc agatgatgca gctgctgctt ctggctttgg tgactgctgc ggggagtgcc 300 
cagcccagga gtgcgcgggc caggacggac ctgctcaatg tctgcatgaa cgccaagcac 360 
cacaagacac agcccagccc cgaggacgag ctgtatggcc agtgcagtcc ctggaagaag 420 
aatgcctgct gcacggccag caccagccag gagctgcaca aggacacctc ccgcctgtac 480 
aactttaact gggatcactg tggtcaacca gagctggcgc aaagagcgca ttctgaacgt 540 
gcccctgtgc aaagaggact gtgagcgctg gtgggaggac tgtcgcacct cctacacctg 600 
caaaagcaac tggcacaaag gctggaattg gacctcaggg attaatgagt gtccggccgg 660 
ggccctctgc agcacctttg agtcctactt ccccactcca gccgcccttt gtgaaggcct 720 
ctggagccac tccttcaagg tcagcaacta tagtcgaggg agctgccgct gcatccagat 780 
gtggtttgac tcagcccagg gcaaccccaa tgaggaggtg gccaagttct atgctgcggc 840 
catgaatgct ggggccccgt ctcgtgggat tattgattcc tgatccaaga agggtcctct 900 
ggggttcttc caacaaccta ttctaataga caaatccaca tgaaaa 946 

<210> 93 

<211> 2737 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7510733CB1 

<400> 93 

agcgaacagg gaatgacagt tccaccagaa gacgattaag ccacagcctc taattggaac 60 
ggcatttgta cagtcagaga ctcttaccag acatctccag gaatctgtga gccattgtca 120 
aaacgtccat tttcatctgg ctgtgaaagt gaggaccaca acaggtaggt attggtagaa 180 
acaggagtcc tcagagaagc cccaagatgc agcctgaggg agcagaaaag ggaaaaagct 240 
tcaagcagag actggtcttg aagagcagct tagcgaaaga aaccctctct gagttcttgg 300 
gcacgttcat cttgattgtc cttggatgtg gctgtgttgc ccaagctatt ctcagtcgag 360 
gacgttttgg aggggtcatc actatcaatg ttggattttc aatggcagtt gcaatggcca 420 
tttatgtggc tggcggtgtc tctgatggac ttatgtcctt tgctggtgga aaactgctga 480 
tcgtgggaga aaatgcaaca gcacacattt ttgcaacata cccagctccg tatctatctc 540 
tggcgaacgc atttgcagat caagtggtgg ccaccatgat actcctcata atcgtctttg 600 
ccatttttga ctccagaaac ttgggagccc ccagaggcct agagcccatt gccatcggcc 660 
tcctgattat tgtcattgct tcctccctgg gactgaacag tggctgtgcc atgaacccag 720 
ctcgagacct gagtcccaga cttttcactg ccttggcagg ctgggggttt gaagtcttca 780 
gagctggaaa caacttctgg tggattcctg tagtgggccc tttggttggt gctgtcattg 840 
gaggcctcat ctatgttctt gtcattgaaa tccaccatcc agagcctgac tcagtcttta 900 
aggcagaaca atctgaggac aaaccagaga aatatgaact cagtgtcatc atgtagtggc 960 
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atgctcagct ctggatttgc agtcagtttg ggattctctt cagaaagatg gcatctaagt 1020 
gtctgtgttc ttgtaagcct gaggtggaat ccacccagtt ttgtctgcta gccatatggg 1080 
acatctaatt ggaaaagcat ctgcataaaa gtttggaaac aatgaccact tctctaccat 1140 
tgtcccccac ccccaccccc cagaataacg ctgactgtcc cctgaaacag ccttctctcc 1200 
tgccctgttt atttcatcct cgatgggaat tcttgctagg taagcactaa taactcggca 1260 
tcttgacgat agtcccattt gggtggtttc agctgcacta tctgtatgaa atggtgtcac 1320 
caaaaccctt ttcttcagta tcgacaaaga ttacattctg agtaccaacc aaaccctaaa 1380 
ttgaaagaca aaactatggt ttcagtcaac atattcatga attagggagc taatgggtta 1440 
agcttccagt tcccgctatg ctactggatt tgtataaata ctgatattct ccaaacctag 1500 
tggtgtaggg agcaagagaa tgcagctgga aggcacaagg ggaggacatt gtggcattca 1560 
gaaactgcag gagacaagat gaatttgaga agccaaatgg aatttttaat ggaaaccatt 1620 
tatcagatta atctcttgct ctcctgcatt ttagaggaca ccaattaatt tcctggtctt 1680 
tagtatataa taacctaaaa taccattgta acctcagtca tgaaaaatac atcactctgt 1740 
ctttttagct caaatgtatt ttcctaattg cccacttgag aacagacatt tgacaagtta 1800 
tatcaacgac tgtgcttgtc cattatttta cacatgccct agaagccaaa actgaaagcc 1860 
actggatcct ggtctagctg aatcttcaga gtgggaggtc tccaaaaaga tattacctta 1920 
ttgggcttaa caattcacaa ggcactttca cacccattat ctaatttaat cctcataatg 1980 
actatgtgag gcaaatgcca cattgcccat ttttcagata aagaaacaaa atcttaggga 2040 
agataagttg agttgtccaa gagcacactg aaagttgaat gttatctaat gcattcctct 2100 
acctttcaga agatcagtag ctggctgaga atctttgcca aatcttcctt gctagccaga 2160 
agtggaattg gcagcttcta gaatatgtac acctctggac aaaatgttcc tcaatcttaa 2220 
gatacaaaga ccctcattgt ctgggtctat tcccacactt actgagtaca gatgaaggaa 2280 
agtggtagca atttaatcat aactttcatt tgctgaaaaa cattatgaga aggcctccct 2340 
tcctaagcca cctctggtct tgctaagtct tgatcttgct tcctgccagc accaaacatt 2400 
acattcaggg gatttcctct ggctcagtct tttccccttg aagttctcta atagatgtta 2460 
cttttgacaa aagatcgcct atgagttaca agcaccaggg gatgctctac atcaagggat 2520 
gcaccttcag tcaaactgtc aaaaagccca gaattcccaa aggcattagg tttcccaact 2580 
gctttgtgct gatatcagaa cagcagaaat taaatgtgaa atgtttctga tgacttatgt 2640 
tctacaatct atggacatac gggatttttt ttttcttgct ttgaagctac ctggatattt 2700 
cctatttgaa ataaaattgt tcggtcattg ttgaaaa 2737 

<210> 94 
<211> 2821 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7510734CB1 

<400> 94 

agcgaacagg gaatgacagt tccaccagaa gacgattaag ccacagcctc taattggaac 60 
ggcatttgta cagtcagaga ctcttaccag acatctccag gaatctgtga gccattgtca 120 
aaacgtccat tttcatctgg ctgtgaaagt gaggaccaca acaggtaggt attggtagaa 180 
acaggagtcc tcagagaagc cccaagatgc agcctgaggg agcagaaaag ggaaaaagct 240 
tcaagcagag actggtcttg aagagcagct tagcgaaaga aaccctctct gagttcttgg 300 
gcacgttcat cttgattgtc cttggatgtg gctgtgttgc ccaagctatt ctcagtcgag 360 
gacgttttgg aggggtcatc actatcaatg ttggattttc aatggcagtt gcaatggcca 420 
tttatgtggc tggcggtgtc tctggtggtc acatcaaccc agctgtgtct ttagcaatgt 480 
gtctctttgg acggatgaaa tggttcaaat tgccatttta tgtgggagcc cagttcttgg 540 
gagcctttgt gggggctgca accgtctttg gcatttacta tgatggactt atgtcctttg 600 
ctggtggaaa actgctgatc gtgggagaaa atgcaacagc acacattttt gcaacatacc 660 
cagctccgta tctatctctg gcgaacgcat ttgcagatca aaaacttggg agcccccaga 720 
ggcctagagc ccattgccat cggcctcctg attattgtca ttgcttcctc cctgggactg 780 
aacagtggct gtgccatgaa cccagctcga gacctgagtc ccagactttt cactgccttg 840 
gcaggctggg ggtttgaagt cttcagagct ggaaacaact tctggtggat tcctgtagtg 900 
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ggccctttgg ttggtgctgt cattggaggc 
catccagagc ctgactcagt ctttaaggca 
gaactcagtg tcatcatgta gtggcatgct 
ctcttcagaa agatggcatc taagtgtctg 
cagttttgtc tgctagccat atgggacatc 
gaaacaatga ccacttctct accattgtcc 
tgtcccctga aacagccttc tctcctgccc 
ctaggtaagc actaataact cggcatcttg 
cactatctgt atgaaatggt gtcaccaaaa 
ttctgagtac caaccaaacc ctaaattgaa 
catgaattag ggagctaatg ggttaagctt 
aaatactgat attctccaaa cctagtggtg 
caaggggagg acattgtggc attcagaaac 
aatggaattt ttaatggaaa ccatttatca 
ggacaccaat taatttcctg gtctttagta 
agtcatgaaa aatacatcac tctgtctttt 
ttgagaacag acatttgaca agttatatca 
gccctagaag ccaaaactga aagccactgg 
aggtctccaa aaagatatta ccttattggg 
attatctaat ttaatcctca taatgactat 
agataaagaa acaaaatctt agggaagata 
tgaatgttat ctaatgcatt cctctacctt 
tgccaaatct tccttgctag ccagaagtgg 
tggacaaaat gttcctcaat cttaagatac 
cacttactga gtacagatga aggaaagtgg 
aaaaacatta tgagaaggcc tcccttccta 
ttgcttcctg ccagcaccaa acattacatt 
ccttgaagtt ctctaataga tgttactttt 
caggggatgc tctacatcaa gggatgcacc 
cccaaaggca ttaggtttcc caactgcttt 
gtgaaatgtt tctgatgact tatgttctac 
ttgctttgaa gctacctgga tatttcctat 



ctcatctatg ttcttgtcat tgaaatccac 9 60 
gaacaatctg aggacaaacc agagaaatat 1020 
cagctctgga tttgcagtca gtttgggatt 1080 
tgttcttgta agcctgaggt ggaatccacc 1140 
taattggaaa agcatctgca taaaagtttg 1200 
cccaccccca ccccccagaa taacgctgac 1260 
tgtttatttc atcctcgatg ggaattcttg 1320 
acgatagtcc catttgggtg gtttcagctg 1380 
cccttttctt cagtatcgac aaagattaca 1440 
agacaaaact atggtttcag tcaacatatt 1500 
ccagttcccg ctatgctact ggatttgtat 1560 
tagggagcaa gagaatgcag ctggaaggca 1620 
tgcaggagac aagatgaatt tgagaagcca 1680 
gattaatctc ttgctctcct gcattttaga 1740 
tataataacc taaaatacca ttgtaacctc 1800 
tagctcaaat gtattttcct aattgcccac 1860 
acgactgtgc ttgtccatta ttttacacat 1920 
atcctggtct agctgaatct tcagagtggg 1980 
cttaacaatt cacaaggcac tttcacaccc 2040 
gtgaggcaaa tgccacattg cccatttttc 2100 
agttgagttg tccaagagca cactgaaagt 2160 
tcagaagatc agtagctggc tgagaatctt 2220 
aattggcagc ttctagaata tgtacacctc 2280 
aaagaccctc attgtctggg tctattccca 2340 
tagcaattta atcataactt tcatttgctg 2400 
agccacctct ggtcttgcta agtcttgatc 2460 
caggggattt cctctggctc agtcttttcc 2520 
gacaaaagat cgcctatgag ttacaagcac 2580 
ttcagtcaaa ctgtcaaaaa gcccagaatt 2640 
gtgctgatat cagaacagca gaaattaaat 2700 
aatctatgga catacgggat tttttttttc 2760 
ttgaaataaa attgttcggt cattgttgaa 2820 

2821 



<210> 95 
<211> 3583 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7503977CB1 



<400> 95 

ctgggcttcg tgccccgacc tggtgtgtca 
cagacctggc tgcaggacct gctgcgtcgt 
gcctggattg tcactggggg tctgcacacg 
cgggaccatc agatggccag cactgggggc 
tggg^tgtgg tccggaatag agacaccctc 
taccggtggc gcggtgaccc ggaggacggg 
gccttcttcc tggtggacga cggcacacac 
ttgcgcctgg agtcctacat ctcacagcag 
atccctgtcc tgctcctcct gattgatggt 
gccacccagg ctcagctccc atgtctcctc 
ctggcggaga ccctggaaga cactctggcc 
gcccgagatc gaatcaggcg tttctttccc 



gtgctggggg gatcgggggg ccccgtcctc 60 
gggctggtgc gggctgccca gagcacagga 120 
ggcatcggcc ggcatgttgg tgtggctgta 180 
accaaggtgg tggccatggg tgtggccccc 240 
atcaacccca agggctcgtt ccctgcgagg 3 00 
gtccagtttc ccctggacta caactactcg 3 60 
ggctgcctgg ggggcgagaa ccgcttccgc 420 
aagacgggcg tgggagggac tggaattgac 480 
gatgagaaga tgttgacgcg aatagagaac 540 
gtggctggct cagggggagc tgcggactgc 600 
ccagggagtg ggggagccag gcaaggcgaa 660 
aaaggggacc ttgaggtcct gcaggcccag 720 
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gtggagagga ttatgacccg gaaggagctc ctgacagtct attcttctga ggatgggtct 780 
gaggaattcg agaccatagt tttgaaggcc cttgtgaagg tccttccatc tcgaagcttc 840 
cctcatggac gccctgctga atgaccggcc tgagttcgtg cgcttgctca tttcccacgg 900 
cctcagcctg ggccacttcc tgaccccgat gcgcctggcc caactctaca gcgcggcgcc 960 
ctccaactcg ctcatccgca accttttgga ccaggcgtcc cacagcgcag gcaccaaagc 1020 
cccagcccta aaagggggag ctgcggagct ccggccccct gacgtggggc atgtgctgag 1080 
gatgctgctg gggaagatgt gcgcgccgag gtacccctcc gggggcgcct gggaccctca 1140 
cccaggccag ggcttcgggg agagcatgta tctgctctcg gacaaggcca cctcgccgct 1200 
ctcgctggat gctggcctcg ggcaggcccc ctggagcgac ctgcttcttt gggcactgtt 1260 
gctgaacagg gcacagatgg ccatgtactt ctgggagatg ggttccaatg cagtttcctc 1320 
agctcttggg gcctgtttgc tgctccgggt gatggcacgc ctggagcctg acgctgagga 1380 
ggcagcacgg aggaaagacc tggcgttcaa gtttgagggg atgggcgttg acctctttgg 1440 
cgagtgctat cgcagcagtg aggtgagggc tgcccgcctc ctcctccgtc gctgcccgct 1500 
ctggggggat gccacttgcc tccagctggc catgcaagct gacgcccgtg ccttctttgc 1560 
ccaggatggg gtacagtctc tgctgacaca gaagtggtgg ggagatatgg ccagcactac 1620 
acccatctgg gccctggttc tcgccttctt ttgccctcca ctcatctaca cccgcctcat 1680 
caccttcagg aaatcagaag aggagcccac acgggaggag ctagagtttg acatggatag 1740 
tgtcattaat ggggaagggc ctgtcgggac ggcggaccca gccgagaaga cgccgctggg 1800 
ggtcccgcgc cagtcgggcc gtccgggttg ctgcgggggc cgctgcgggg ggcgccggtg 1860 
cctacgccgc tggttccact tctggggcgc gccggtgacc atcttcatgg gcaacgtggt 1920 
cagctacctg ctgttcctgc tgcttttctc gcgggtgctg ctcgtggatt tccagccggc 1980 
gccgcccggc tccctggagc tgctgctcta tttctgggct ttcacgctgc tgtgcgagga 2040 
actgcgccag ggcctgagcg gaggcggggg cagcctcgcc agcgggggcc ccgggcctgg 2100 
ccatgcctca ctgagccagc gcctgcgcct ctacctcgcc gacagctgga accagtgcga 2160 
cctagtggct ctcacctgct tcctcctggg cgtgggctgc cggctgaccc cgggtttgta 2220 
ccacctgggc cgcactgtcc tctgcatcga cttcatggtt ttcacggtgc ggctgcttca 2280 
catcttcacg gtcaacaaac agctggggcc caagatcgtc atcgtgagca agatgatgaa 2340 
ggacgtgttc ttcttcctct tcttcctcgg cgtgtggctg gtagcctatg gcgtggccac 2400 
ggaggggctc ctgaggccac gggacagtga cttcccaagt atcctgcgcc gcgtcttcta 2460 
ccgtccctac ctgcagatct tcgggcagat tccccaggag gacatggacg tggccctcat 2520 
ggagcacagc aactgctcgt cggagcccgg cttctgggca caccctcctg gggcccaggc 2580 
gggcacctgc gtctcccagt atgccaactg gctggtggtg ctgctcctcg tcatcttcct 2640 
gctcgtggcc aacatcctgc tggtcaactt gctcattgcc atgttcagtt acacattcgg 2700 
caaagtacag ggcaacagcg atctctactg gaaggcgcag cgttaccgcc tcatccggga 2760 
attccactct cggcccgcgc tggccccgcc ctttatcgtc atctcccact tgcgcctcct 2820 
gctcaggcaa ttgtgcaggc gaccccggag cccccagccg tcctccccgg ccctcgagca 2880 
tttccgggtt tacctttcta aggaagccga gcggaagctg ctaacgtggg aatcggtgca 2940 
taaggagaac tttctgctgg cacgcgctag ggacaagcgg gagagcgact ccgagcgtct 3000 
gaagcgcacg tcccagaagg tggacttggc actgaaacag ctgggacaca tccgcgagta 3060 
cgaacagcgc ctgaaagtgc tggagcggga ggtccagcag tgtagccgcg tcctggggtg 3120 
ggtggccgag gccctgagcc gctctgcctt gctgccccca ggtgggccgc caccccctga 3180 
cctgcctggg tccaaagact gagccctgct ggcggacttc aaggagaagc ccccacaggg 3240 
gattttgctc ctagagtaag gctcatctgg gcctcggccc ccgcacctgg tggccttgtc 3300 
cttgaggtga gccccatgtc catctgggcc actgtcagga ccacctttgg gagtgtcatc 3360 
cttacaaacc acagcatgcc cggctcctcc cagaaccagt cccagcctgg gaggatcaag 3420 
gcctggatcc cgggccgtta tccatctgga ggctgcaggg tccttggggt aacagggacc 3480 
acagacccct caccactcac agattcctca cactggggaa ataaagccat ttcagaggaa 3540 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aag 3583 



<210> 96 
<211> 2125 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
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<223> Incyte ID No: 7505084CB1 



<400> 96 

gagaacttta cgcctggatc tcatctaact 
ctcgttcagg accatggaga gcggcaccag 
ggatgcgttt ccccagaagg gcttggagcc 
cctctttgtc ctggctgttg gactatggtc 
aggctacttc ctggctggag gggacatggt 
cagcaatgtt ggaagtggac atttcattgg 
ttctgtatca gcttatgaac ttaatggctt 
cctacccatc tacattgctg gtcaggtcac 
cggtggcatc agaatcccca tcatcctggc 
caagatctcg gtagacatgt atgcaggtgc 
tctgtacctg gccatagttg ggctactggc 
cctggctgct gtgatctaca cggatgccct 
caccttgatg ggctacagtt tcgccgcggt 
cttcttggcc ctggctagca accggagtga 
tgccttccat attttccgag atccgctgac 
tggaatgtcc atcccatccc tctggtactg 
tctggctgcc aagaacctgt cccatgccaa 
ggtgctgccc ctcttcataa tggtgttccc 
tcaagtggcc tgtgcagatc cagagatctg 
ttcggacatc gcgtatccca aactcgtgct 
ttcaccatgg acctctggaa tcacctccgg 
gtgggcaggg tgtttgtgct gctgctggtc 
caggccagcc agggcggcca gctcttcatc 
ccgcctgtgg cggtggtctt catcatggga 
gccttctggg gcctgatctc gggcctgctc 
atttacgtgc agcctcgatg cgaccagcca 
cactacctct acttctccat gatcctgtcc 
agctggttca cagagccacc ctccaaggag 
cacgaccccg tggtccagaa ggaacaagca 
tctcagaacg ggatgccaga ggccagcagc 
gaaaacacgt ctaaaaccca cagctgtgac 
gccatcctgt ggctctgtgg aatacaggag 
gaagccatca tagtttccct ggaagaaaac 
ctcattttct gcgtgagctg cgccatcttt 
aacccagggg tccaaactct gtttctcttc 
taataaagct tttgtttacc aaaaa 

<210> 97 

<211> 1517 

<212> DNA 

<213> Homo sapiens 



gacacagaaa ccctgtaagg atccagaggt 60 
cagccctcag cctccacagt tagatcccct 120 
tggggacatc gcggtgctag ttctgtactt 180 
cacagtgaag accaaaagag acacagtgaa 240 
gtggtggcca gtgggtgcat ccttgtttgc 300 
cctggcaggg tcaggtgctg ctacgggcat 360 
gttttctgtg ctgatgttgg cctggatctt 420 
cacgatgcca gaatacctac ggaagcgctt 480 
tgtactctac ctatttatct acatcttcac 540 
catcttcatc cagcagtctt tgcacctgga 600 
catcactgct gtatacacgg ttgctggtgg 660 
gcagacgctg atcatgctta taggagcgct 720 
tggtgggatg gaaggactga aggagaagta 780 
gaacagcagc tgcgggctgc cccgggaaga 840 
atctgatctc ccgtggccgg gggtcctatt 900 
gtgcacggat caggtgattg tccagcggac 960 
aggaggtgct ctgatggctg catacctgaa 1020 
tgggatggtc agccgcatcc tcttcccaga 1080 
ccagaagatc tgcagcaacc cctcaggctg 1140 
ggaactcctg cccacagtgc cagcaccatc 1200 
cctcgggcat ctgagaagga gctcatgatt 1260 
ctggtctcca tcctctggat ccctgtggtc 1320 
tatatccagt ccatcagctc ctacctgcag 1380 
tgtttctgga agaggaccaa tgaaaagggt 1440 
ctgggcttgg ttaggctggt cctggacttt 1500 
gatgagcgcc cggtcctggt gaagagcatt 1560 
acggtcaccc tcatcactgt ctccaccgtg 1620 
atggtcagcc acctgacctg gtttactcgt 1680 
ccaccagcag ctcccttgtc tcttaccctc 1740 
agcagcagcg tccagttcga gatggttcaa 1800 
atgaccccaa agcagtccaa agtggtgaag 1860 
aagggcaagg aagagctccc ggccagagca 1920 
cccttggtga agaccctcct ggacgtcaac 1980 
atctggggct attttgctta gtgtggggtg 2040 
agtgctccat ttttttaatg aaagaaaaaa 2100 

2125 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506950CB1 



<400> 97 

gaagaaaaca ccgaattctg cttgccgttt 
catctacaac atgcagttcc tgctttttgt 
gctggctaac atccaagaag atgaggctaa 
tgacagactt ctggatggtt acgataatcg 
tgaagtcttc actaacatct acgtgaccag 
atatacaatt gatgttttct ttcgacaaaa 



cagagcggcg gtgatgaaga caaaattgaa 60 
tttcttggtg tgggaccctg ccaggttggt 120 
aaataacatt accatcttta cgagaattct 180 
gcttagacca ggactgggag acagtattac 240 
ttttggccct gtctcagata cagatatgga 300 
atggaaagat gaacgtttaa aatttaaagg 360 
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tcctatgaat atccttcgac taaacaattt 
cttttttcac aatgggaaaa aatcagtagc 
tcgaattcag gatgatggga ctctgctgta 
tgacaactct aagcatcagt gctcggaatt 
tggactggtt tattgctgtt tgttatgcat 
ctgttaatta cttcaccaaa agaggatggg 
agaaaaaaga aaaggcttcc gttatgatac 
attatgcccc gaatctttca aaagatccag 
cgccagaacc caacaagaag ccagaaaaca 
gtgttagcaa aattgacaga atgtccagaa 
atttagttta ctgggctaca tatttaaaca 
ttgagaccca tgttatcttt gggatgtata 
tacagtctga ctaataactg ctaatttgtg 
atagcttacc agtagacctt taatggagac 
agaaagcact ccatgcgaaa acagccattg 
tttaaagtga atttcaaatg acctgattaa 
gggatcctgt acaacccttt gtggacactt 
tactgttgcc taatatgatg gaagtaacat 
acattgttct ggaatcagct cggtccagat 
ggtaccctaa gaaaaca 

<210> 98 

<211> 1694 

<212> DNA 

<213> Homo sapiens 



aatggctagc aaaatctgga ctccagatac 420 
tcataatatg acaatgccaa ataagttgct 480 
taccatgagg agtaacaact gtcctaacaa 540 
ctctccccaa agtggcttat gcaactgcca 600 
ttgtgttctc tgccctaatt gaatttgcaa 660 
cttgggatgg gaagagtgta gtaaatgaca 720 
agaacaacgc ttatgcagtg gctgttgcca 780 
ttctctccac catctccaag agtgcaacca 840 
agccagctga agcaaagaaa actttcaaca 900 
tagtttttcc agttttgttt ggtaccttta 960 
gagaacctgt attaggggtc agtccttgaa 1020 
gcaacattaa atttggtttg ttttgctatg 1080 
atccaacatg tacagtatgt atatagtgac 1140 
atgcatttgc taactcatgg aactgcagac 1200 
ccttttttaa agatttaccc taggacctga 1260 
tttcctattc ttccaaatga gatgaaaatg 1320 
ttggtttagc tcttaagtag gggtattttc 1380 
tgtcattcta gatgaatctt tgaagtacca 1440 
gctcaaggtc cctgtaatgt attggaagct 1500 

1517 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7506951CB1 

<400> 98 

gaagaaaaca ccgaattctg cttgccgttt cagagcggcg gtgatgaaga caaaattgaa 60 
catctacaac atgcagttcc tgctttttgt tttcttggtg tgggaccctg ccaggttggt 120 
gctggctaac atccaagaag atgaggctaa aaataacatt accatcttta cgagaattct 180 
tgacagactt ctggatggtt acgataatcg gcttagacca ggactgggag acagtattac 240 
tgaagtcttc actaacatct acgtgaccag ttttggccct gtctcagata cagatatgga 300 
atatacaatt gatgttttct ttcgacaaaa atggaaagat gaacgtttaa aatttaaagg 360 
tcctatgaat atccttcgac taaacaattt aatggctagc aaaatctgga ctccagatac 420 
cttttttcac aatgggaaga aatcagtagc tcataatatg acaatgccaa ataagttgct 480 
tcgaattcag gatgatggga ctctgctgta taccatgagg cttacagttc aagctgaatg 540 
cccaatgcac ttggaggatt tcccaatgga tgctcattca tgtcctctga aatttggcag 600 
ctatgcatat acaacttcag aggtcactta tatttggact tacaatgcat ctgattcagt 660 
acaggttgct cctgatggct ctaggttaaa tcaatatgac ctgctgggcc aatcaatcgg 720 
aaaggagaca attaaatcca gtacaggtga atatactgta atgacagctc atttccacct 780 
gaaaagaaaa attgggtatt ttgtgattca aacctatctg ccttgcatca tgactgtcat 840 
tctctcccaa gtttcattct ggcttaacag agaatctgtg cctgcaagaa ctgtgtttga 900 
aaaaagaaaa ggcttccgtt atgatacaga acaacgctta tgcagtggct gttgccaatt 960 
atgccccgaa tctttcaaaa gatccagttc tctccaccat ctccaagagt gcaaccacgc 1020 
cagaacccaa caagaagcca gaaaacaagc cagctgaagc aaagaaaact ttcaacagtg 1080 
ttagcaaaat tgacagaatg tccagaatag tttttccagt tttgtttggt acctttaatt 1140 
tagtttactg ggctacatat ttaaacagag aacctgtatt aggggtcagt ccttgaattg 1200 
agacccatgt tatctttggg atgtatagca acattaaatt tggtttgttt tgctatgtac 1260 
agtctgacta ataactgcta atttgtgatc caacatgtac agtatgtata tagtgacata 1320 
gcttaccagt agacctttaa tggagacatg catttgctaa ctcatggaac tgcagacaga 1380 
aagcactcca tgcgaaaaca gccattgcct tttttaaaga tttaccctag gacctgattt 1440 
aaagtgaatt tcaaatgacc tgattaattt cctattcttc caaatgagat gaaaatgggg 1500 
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atcctgtaca accctttgtg gacacttttg 
tgttgcctaa tatgatggaa gtaacattgt 
ttgttctgga atcagctcgg tccagatgct 
accctaagaa aaca 



gtttagctct taagtagggg tattttctac 1560 
cattctagat gaatctttga agtaccaaca 1620 
caaggtccct gtaatgtatt ggaagctggt 1680 

1694 



<210> 99 

<211> 1102 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7506954CB1 



<400> 99 

ctctctctct ctctctctct ctctctctct 
tcaagatcag ggcaaaagaa gaaaacaccg 
atgaagacaa aattgaacat ctacaacatg 
gaccctgcca ggttggtgct ggctaacatc 
atctttacga gaattcttga cagacttctg 
ctgggagaaa aaagaaaagg cttccgttat 
tgccaattat gccccgaatc tttcaaaaga 
aaccacgcca gaacccaaca agaagccaga 
caacagtgtt agcaaaattg acagaatgtc 
ctttaattta gtttactggg ctacatattt 
ttgaattgag acccatgtta tctttgggat 
ctatgtacag tctgactaat aactgctaat 
gtgacatagc ttaccagtag acctttaatg 
cagacagaaa gcactccatg cgaaaacagc 
cctgatttaa agtgaatttc aaatgacctg 
aaatggggat cctgtacaac cctttgtgga 
ttttctactg ttgcctaata tgatggaagt 
taccaacatt gttctggaat cagctcggtc 
aagctggtac cctaagaaaa ca 



ctctctctct ctcccaagtt tcctatctcg 60 
aattctgctt gccgtttcag agcggcggtg 120 
cagttcctgc tttttgtttt cttggtgtgg 180 
caagaagatg aggctaaaaa taacattacc 240 
gatggttacg ataatcggct tagaccagga 300 
gatacagaac aacgcttatg cagtggctgt 360 
tccagttctc tccaccatct ccaagagtgc 420 
aaacaagcca gctgaagcaa agaaaacttt 480 
cagaatagtt tttccagttt tgtttggtac 540 
aaacagagaa cctgtattag gggtcagtcc 600 
gtatagcaac attaaatttg gtttgttttg 660 
ttgtgatcca acatgtacag tatgtatata 720 
gagacatgca tttgctaact catggaactg 780 
cattgccttt tttaaagatt taccctagga 840 
attaatttcc tattcttcca aatgagatga 900 
cacttttggt ttagctctta agtaggggta 960 
aacattgtca ttctagatga atctttgaag 1020 
cagatgctca aggtccctgt aatgtattgg 1080 

1102 



<210> 100 

<211> 1744 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7506956CB1 



<400> 100 

gaagaaaaca ccgaattctg cttgccgttt 
catctacaac atgcagttcc tgctttttgt 
gctggctaac atccaagaag atgaggctaa 
tgacagactt ctggatggtt acgataatcg 
tgaagtcttc actaacatct acgtgaccag 
atatacaatt gatgttttct ttcgacaaaa 
tcctatgaat atccttcgac taaacaattt 
cttttttcac aatgggaaga aatcagtagc 
tcgaattcag gatgatggga ctctgctgta 
cccaatgcac ttggaggatt tcccaatgga 
ctatgcatat acaacttcag aggtcactta 



cagagcggcg gtgatgaaga caaaattgaa 60 
tttcttggtg tgggaccctg ccaggttggt 120 
aaataacatt accatcttta cgagaattct 180 
gcttagacca ggactgggag acagtattac 240 
ttttggccct gtctcagata cagatatgga 300 
atggaaagat gaacgtttaa aatttaaagg 360 
aatggctagc aaaatctgga ctccagatac 420 
tcataatatg acaatgccaa ataagttgct 480 
taccatgagg cttacagttc aagctgaatg 540 
tgctcattca tgtcctctga aatttggcag 600 
tatttggact tacaatgcat ctgattcagt 660 
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acaggttgct cctgatggct ctaggttaaa 
aaaggagaca attaaatcca gtacaggagt 
catcagtgct cggaattctc tccccaaagt 
tgctgtttgt tatgcatttg tgttctctgc 
caccaaaaga ggatgggctt gggatgggaa 
ggcttccgtt atgatacaga acaacgctta 
tctttcaaaa gatccagttc tctccaccat 
caagaagcca gaaaacaagc cagctgaagc 
tgacagaatg tccagaatag tttttccagt 
ggctacatat ttaaacagag aacctgtatt 
tatctttggg atgtatagca acattaaatt 
ataactgcta atttgtgatc caacatgtac 
agacctttaa tggagacatg catttgctaa 
tgcgaaaaca gccattgcct tttttaaaga 
tcaaatgacc tgattaattt cctattcttc 
accctttgtg gacacttttg gtttagctct 
tatgatggaa gtaacattgt cattctagat 
atcagctcgg tccagatgct caaggtccct 
aaca 



tcaatatgac ctgctgggcc aatcaatcgg 720 
aacaactgtc ctaacaatga caactctaag 780 
ggcttatgca actgccatgg actggtttat 840 
cctaattgaa tttgcaactg ttaattactt 900 
gagtgtagta aatgacaaga aaaaagaaaa 960 
tgcagtggct gttgccaatt atgccccgaa 1020 
ctccaagagt gcaaccacgc cagaacccaa 1080 
aaagaaaact ttcaacagtg ttagcaaaat 1140 
tttgtttggt acctttaatt tagtttactg 1200 
aggggtcagt ccttgaattg agacccatgt 1260 
tggtttgttt tgctatgtac agtctgacta 1320 
agtatgtata tagtgacata gcttaccagt 1380 
ctcatggaac tgcagacaga aagcactcca 1440 
tttaccctag gacctgattt aaagtgaatt 1500 
caaatgagat gaaaatgggg atcctgtaca 1560 
taagtagggg tattttctac tgttgcctaa 1620 
gaatctttga agtaccaaca ttgttctgga 1680 
gtaatgtatt ggaagctggt accctaagaa 1740 

1744 



<210> 101 

<211> 1753 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506959CB1 

<400> 101 

gaagaaaaca ccgaattctg cttgccgttt cagagcggcg gtgatgaaga caaaattgaa 60 
catctacaac atgcagttcc tgctttttgt tttcttggtg tgggaccctg ccaggttggt 120 
gctggctaac atccaagaag atgaggctaa aaataacatt accatcttta cgagaattct 180 
tgacagactt ctggatggtt acgataatcg gcttagacca ggactgggag acagtattac 240 
tgaagtcttc actaacatct acgtgaccag ttttggccct gtctcagata cagatatgga 300 
atatacaatt gatgttttct ttcgacaaaa atggaaagat gaacgtttaa aatttaaagg 360 
tcctatgaat atccttcgac taaacaattt aatggctagc aaaatctgga ctccagatac 420 
cttttttcac aatgggaaga aatcagtagc tcataatatg acaatgccaa ataagttgct 480 
tcgaattcag gatgatggga ctctgctgta taccatgagg cttacagttc aagctgaatg 540 
cccaatgcac ttggaggatt tcccaatgga tgctcattca tgtcctctga aatttggcag 600 
ctgtgaatat actgtaatga cage teat tt ccacctgaaa agaaaaattg ggtattttgt 660 
gattcaaacc tatctgeett gcatcatgac tgtcattctc tcccaagttt cattctggct 720 
taacagagaa tctgtgcctg caagaactgt gtttggagta acaactgtcc taacaatgac 780 
aactctaagc ateagtgetc ggaattctct ccccaaagtg gettatgeaa ctgccatgga 840 
ctggtttatt gctgtttgtt atgcatttgt gttctctgcc ctaattgaat ttgcaactgt 900 
taattacttc accaaaagag gatgggcttg ggatgggaag agtgtagtaa atgacaagaa 960 
aaaagaaaag gettcegtta tgatacagaa caaegcttat gcagtggctg ttgecaatta 1020 
tgccccgaat ctttcaaaag atccagttct ctccaccatc tccaagagtg caaccacgcc 1080 
agaacccaac aagaagccag aaaacaagee agctgaagca aagaaaactt tcaacagtgt 1140 
tagcaaaatt gacagaatgt ccagaatagt ttttccagtt ttgtttggta cctttaattt 1200 
agtttactgg gctacatatt taaacagaga acctgtatta ggggtcagtc cttgaattga 1260 
gacccatgtt atctttggga tgtatagcaa cattaaattt ggtttgtttt gctatgtaca 1320 
gtctgactaa taactgctaa tttgtgatcc aacatgtaca gtatgtatat agtgacatag 1380 
cttaccagta gacctttaat ggagacatgc atttgetaac tcatggaact gcagacagaa 1440 
agcactccat gcgaaaacag ccattgcctt ttttaaagat ttaccctagg acctgattta 1500 
aagtgaattt caaatgacct gattaatttc ctattcttcc aaatgagatg aaaatgggga 1560 
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tcctgtacaa ccctttgtgg acacttttgg tttagctctt aagtaggggt attttctact 1620 
gttgcctaat atgatggaag taacattgtc attctagatg aatctttgaa gtaccaacat 1680 
tgttctggaa tcagctcggt ccagatgctc aaggtccctg taatgtattg gaagctggta 1740 
ccctaagaaa aca 1753 

<210> 102 

<211> 1609 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7506960CB1 

<400> 102 

ctctctctct ctctctctct ctctctctct ctctctctct ctcccaagtt tcctatctcg 60 
tcaagatcag ggcaaaagaa gaaaacaccg aattctgctt gccgtttcag agcggcggtg 120 
atgaagacaa aattgaacat ctacaacatg cagttcctgc tttttgtttt cttggtgtgg 180 
gaccctgcca ggttggtgct ggctaacatc caagaagatg aggctaaaaa taacattacc 240 
atctttacga gaattcttga cagacttctg gatggttacg ataatcggct tagaccagga 300 
ctgggaggaa tatacaattg atgttttctt tcgacaaaaa tggaaagatg aacgtttaaa 360 
atttaaaggt cctatgaata tccttcgact aaacaattta atggctagca aaatctggac 420 
tccagatacc ttttttcaca atgggaagaa atcagtagct cataatatga caatgccaaa 480 
taagttgctt cgaattcagg atgatgggac tctgctgtat accatgaggc ttacagttca 540 
agctgaatgc ccaatgcact tggaggattt cccaatggat gc teat teat gtcctctgaa 600 
atttggcagc tgagtaacaa ctgtcctaac aatgacaact ctaagcatca gtgctcggaa 660 
ttctctcccc aaagtggctt atgcaactgc catggactgg tttattgctg tttgttatgc 720 
atttgtgttc tctgccctaa ttgaatttgc aactgttaat tacttcacca aaagaggatg 780 
ggcttgggat gggaagagtg tagtaaatga caagaaaaaa gaaaaggctt ccgttatgat 840 
acagaacaac gettatgeag tggctgttgc caattatgee ccgaatcttt caaaagatcc 9Q0 
agttctctcc accatctcca agagtgeaac cacgccagaa cccaacaaga agecagaaaa 960 
caagccagct gaagcaaaga aaactttcaa cagtgttagc aaaattgaca gaatgtccag 1020 
aatagttttt ccagttttgt ttggtacctt taatttagtt tactgggcta catatttaaa 1080 
cagagaacct gtattagggg tcagtccttg aattgagacc catgttatct ttgggatgta 1140 
tagcaacatt aaatttggtt tgttttgcta tgtacagtct gactaataac tgctaatttg 1200 
tgatccaaca tgtacagtat gtatatagtg acatagctta ccagtagacc tttaatggag 1260 
acatgeattt gctaactcat ggaactgcag acagaaagca ctccatgcga aaacagecat 1320 
tgcctttttt aaagatttac cctaggacct gatttaaagt gaatttcaaa tgacctgatt 1380 
aatttcctat tcttccaaat gagatgaaaa tggggatcct gtacaaccct ttgtggacac 1440 
ttttggttta gctcttaagt aggggtattt tctactgttg cctaatatga tggaagtaac 1500 
attgtcattc tagatgaatc tttgaagtac caacattgtt ctggaatcag ctcggtccag 1560 
atgetcaagg tccctgtaat gtattggaag ctggtaccct aagaaaaca 1609 

<210> 103 

<211> 1930 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 7510540CB1 

<400> 103 

attgacagga ctcccaacta gtacaatgac agaagataag gtcactggga ccctggtttt 60 
cactgtcatc actgctgtgc tgggttcctt ccagtttgga tatgacattg gtgtgatcaa 120 
tgcacctcaa cagaatcaaa gccatgttag tagcaaacat tctgtcatta gttggagctc 180 
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tcttgatggg gttttcaaaa ttgggaccat 
tatcaggact atattgtggg ctaatttcag 
ctccaaccgc tctcagggga gcacttggca 
ttcttattag tcagattatt ggtcttgaat 
tcctgcttgg cctgtctggt gtgcgagcca 
cagaaagccc cagatacctt tacatcaagt 
tgaaaagact cagaggatat gatgatgtca 
gagaagaagc atcgagtgag cagaaagtct 
accgacagcc tattctagtg gcactgatgc 
atggcatttt ttactactca accagcattt 
atgcaaccat tggagttggc gctgtaaaca 
tggagaaggc agggcgacgt tctctctttc 
ccatcttcat gtcagtggga cttgtgctgc 
gcatgatagc catcttcctc tttgtcagct 
ggttcatggt ggctgagttt ttcagtcaag 
cattcagcaa ttggacctgc aatttcattg 
tctgtggacc ttatgtgttt ttcctctttg 
cattttttaa agttccagaa accaaaggaa 
aaaagaagag tggctcagcc cacaggccaa 
ctacagagac tgtgtaaaaa aaaaaccctg 
accgtctgtt tttaaatgat gattccttga 
tttattttta tgtgctctca tcagaaatgt 
aagttagaga atatattttt gatggtaaga 
tattttgtta cactaaaggg caggtggttc 
ggttcttcta aaattgaaga gatttcaaca 
tgaggatgca acaaatattt atatatttga 
atatcttatg ttaaaggaga tatggctagt 
tcttccattt attgcactca gcttttttct 
ttttactttt gttctgtatt ttcatgtgga 
atagagaaaa 

<210> 104 

<211> 1205 

<212> DNA 

<213> Homo sapiens 



ctcatatact tataattgct ggaagaagca 240 
gcctggttcc tatgtatatc ggtgaaattg 300 
cttttcatca gctggccatc gtcacgggca 3 60 
ttatcttggg caattatgat ctgtggcaca 420 
tccttcagtc tctgctactc tttttctgtc 480 
tagatgagga agtcaaagca aaacaaagct 540 
ccaaagatat taatgaaatg agaaaagaaa 600 
ctataattca gctcttcacc aattccagct 660 
tgcatgtggc tcagcaattt tccggaatca 720 
ttcagacggc tggtatcagc aaacctgttt 780 
tggttttcac tgctgtctct gtattccttg 840 
taattggaat gagtgggatg tttgtttgtg 900 
tgaataagtt ctcttggatg agttatgtga 960 
tctttgaaat tgggccaggc ccgatcccct 1020 
gaccacgtcc tgctgcttta gcaatagctg 1080 
tagctctgtg tttccagtac attgcggact 1140 
ctggagtgct cctggccttt accctgttta 1200 
agtcttttga ggaaattgct gcagaattcc 1260 
aagctgctgt agaaatgaaa ttcctaggag 1320 
ctttttgaca tgaacagaaa caataaggga 1380 
gcattttata tccacatctt taagtattgt 1440 
catcaaatat taccaaaaaa gtattttttt 1500 
ctgtaattaa gtaaaccaaa aaggctagtt 1560 
taatattttt agctctgttc tttataacaa 1620 
tatcattttt ttaacacata actagaaacc 1680 
atatcattaa attggaattt tcttacccat 1740 
ggcaataagt tccatgttaa aatagacaac 1800 
tgagtactag aatttgtatt ttgcttaaaa 1860 
atggattata gagtatacta aaaaatgtct 1920 

1930 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7510545CB1 



<400> 104 

cttggcggag gcgaggaggc cgcggggacg 
atggagaacg cgcacaccaa gacggtggag 
agtgaatctg tctctgtcat caagcacact 
caagataaaa agaacatgct gttttctgtg 
ctgcctgcag tcatcaccac ctgcctggct 
gccattgttc gaagcctccc gtctgtggaa 
gacaagactg gtacacttac aacaaaccag 
agagtggaag atcacaccgc tgaacgtgac 
gcccgtgatt ctcatggatg agacgctcaa 
taaagagtgt gtgcagcctg ccaccaaatc 
ttcctggccg tttgtgctgc tcataatgcc 
taactttagc gatatgttct ggtcttgact 
taatcaatta atttttttat tgtttaaagc 
aacatactgg ctggtgatgg aggtttcata 
cagtggggca agattttcct tttttataca 



ggaggcgagg ccggccgggc ccccgaagcc 60 
gaggtgctgg gccacttcgg cgtcaacgag 120 
gatcccgtcc ctgacccacg agctgtcaac 180 
gccctggctg tagcagccat tcctgaaggt 240 
cttggaactc gcagaatggc aaagaaaaat 300 
acccttggtt gtacttctgt tatctgctca 360 
atgtcagtct gcaggatgtt cattctggac 420 
ccagtggctg atggtgctga aaatctcctt 480 
gtttgtggcc cgcaactacc tggaacctgg 540 
ctgctcgttc tcggcatgca ccgatgggat 600 
cctggtgatc tgggtctata gcacagacac 660 
gacagttttc cataaagaag atgtttaact 720 
aactgtctat ttctgctgaa ttttcacatg 780 
ctctagattt tgttttgctt tttctgactc 840 
cataattaaa gtgtccattg acatgtacag 900 



101/115 



WO 03/083085 



PCT7US03/09797 



agaactaaca ctattttatg caaatatttt 
tgtttaatac tcatccttgt ataaaaaaaa 
taattggcag cagattttag gaaatgaatg 
atgtattgtg tcttttgcat gatgatccgg 
ttcataagcc attttcgcac ggcagagtct 
cactg 

<210> 105 

<211> 1790 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte 10 No: 7510654CB1 



tttgtagatg aaaaagcatg tacagtgttc 960 
tagttgagcc agcagacatt gtcagcaaat 1020 
tgtgtggttt tttttctaaa actaaatagc 1080 
atttaatttg atatcacagt ctaattttta 1140 
gcactcatca gatgtttggt gcactcgcac 1200 

1205 



<400> 105 

cggcggttca ggcgccagag ctggccgatc ggcgttggcc gccgacatga cgcccgagga 60 
cccagaggaa acccagccgc ttctggggcc tcctggcggc agcgcgcccc gcggccgccg 120 
cgtcttcctc gccgccttcg ccgctgccct gggcccactc agcttcggct tcgcgctcgg 180 
ctacagctcc ccggccatcc ctagcctgca gcgcgccgcg cccccggccc cgcgcctgga 240 
cgacgccgcc gcctcctggt tcggggctgt cgtgaccctg ggtgccgcgg cggggggagt 300 
gctgggcggc tggctggtgg accgcgccgg gcgcaagctg agcctcttgc tgtgctccgt 360 
gcccttcgtg gccggctttg ccgtcatcac cgcggcccag gacgtgtgga tgctgctggg 420 
gggccgcctc ctcaccggcc tggcctgcgg tgttgcctcc ctagtggccc cggtctacat 480 
ctccgaaatc gcctacccag cagtccgggg gttgctcggc tcctgtgtgc agctaatggt 540 
cgtcgtcggc atcctcctgg cctacctggc aggctgggtg ctggagtggc gctggctggc 600 
tgtgctgggc tgcgtgcccc cctccctcat gctgcttctc atgtgcttca tgcccgagac 660 
cccgcgcttc ctgctgactc agcacaggcg ccaggaggcc atggccgccc tgcggttcct 720 
gtggggctcc gagcagggct gggaagaccc ccccatcggg gctgagcaga gctttcacct 780 
ggccctgctg cggcagcccg gcatctacaa gcccttcatc atcggcgtct ccctgatggc 840 
cttccagcag ctgtcggggg tcaacgccgt catgttctat gcagagacca tctttgaaga 900 
ggccaagttc aaggacagca gcctggcctc ggtcgtcgtg ggtgtcatcc aggtgctgtt 960 
cacagctgtg gcggctctca tcatggacag agcagggcgg aggctgctcc tggtcttgtc 1020 
aggaggtcct caggccctat ggagccttct ggcttgcctc cgctttctgc atcttcagtg 1080 
tccttttcac tttgttctgt gtccctgaaa ctaaaggaaa gactctggaa caaatcacag 1140 
cccattttga ggggcgatga cagccactca ctaggggatg gagcaagcct gtgactccaa 1200 
gctgggccca agcccagagc ccctgcctgc cccaggggag ccagaatcca gccccttgga 1260 
gccttggtct gcagggtccc tccttcctgt catgctccct ccagcccatg acccggggct 1320 
aggaggctca ctgcctcctg ttccagctcc tgctgctgct ctgaggactc aggaacacct 1380 
tcgagctttg cagacctgcg gtcagccctc catgcgcaag actaaagcag cggaagagga 1440 
ggtgggcctc taggatcttt gtcttctggc tggaggtgct tttggaggtt gggtgctggg 1500 
cattcagtcg ctcctctcac gcggctgcct tatcgggaag gaaatttgtt tgccaaataa 1560 
agactgacac agaaaatcag gtcagtgtct ctgggctttg tgcaagctca gtttgaaaag 1620 
ggtttattcc catcactgcc caggacaccc tgtggcttta cttgctcatg gtcagccaag 1680 
cttacccttc acactgagaa gtcatttctg gctacttcct tgggctcagt tccctgggtc 1740 
atcagccatc aaatcttgtt gagtttaaaa aataaacagc aaaaaaaaaa 1790 



<210> 106 

<211> 3824 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: 7510660CB1 
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<400> 106 

gccagcctgt tctgttgccc tggctcttcc 
cttaccggaa gtaaaacttc ggaagtgagg 
taggaaagat ggcggcagcg gcggcggtgg 
gcggggtccg gcctgacggg cagcccaagc 
ccgggccggc gctcatagcg aacggtgacg 
ggttggcgct gttgcggcgc ctcacggtgc 
ggttgggtgc cgcagccgct ggctgctggg 
aagctgcgct gctcgtgctt gccaccatct 
ggcattggtc tgtgcacgcg cattgcgcgc 
aagcgacctt tgtgaaggtg gtgccaaccc 
tgcaccgcaa tgagggcgaa gacgggcttg 
agtattccta cgatgccctg gagaagaagc 
acgccttctc atactatcag agcaacagag 
ctgagaagaa atttgggagc aacaaggccg 
tcaaggagag agccacagcc cccttctttg 
gcctggatga gtactggtac tacagcgtct 
cctcgctggt gcagcagcag atgcggaaca 
cccacatgat ccaggtctac cgaagccgca 
taccagggga catcgtctcc atcggccgct 
tgcttctgct gcgaggccgc tgcatcgtag 
cacagatgaa ggagcccatc gaagacctca 
attcccggct gcacgtcatc ttcgggggca 
aagccaccac gggcctgaag ccggttgaca 
gattcaacac atcccagggc aagctgctgc 
ctgcgaacaa cctggagacc ttcatcttca 
cagctgccta tgtatggatt gaaggtacca 
ttctggagtg caccctgatc ctcacctcgg 
ccctggccgt caacacctcc ctcatcgccc 
ccttccggat cccctttgct ggcaaggtcg 
tgaccagtga cagcctggtg gtgcgcggtg 
ccccagtgtc cagcatccct gtagaaacac 
tgcagctgga cgacggcacc ctcgtgggtg 
tggactggac gctgaccaaa gatgagaaag 
tgaaaattca ccagcgcttt cattttgcca 
cgtatgagaa gctgggctcc accgacctct 
aaactctgca ctccatgttc tcccagtgcc 
tctcccggga aggagcccgc gtcctggcgc 
accagcaggt ggtcatgatc acgggagaca 
agctgcactt cattgaaaag gcccacacgc 
ggcagtgcga gtggcgctcc attgacggca 
caaaggcact ggccctggag tacgcactgt 
aggccaccga cccccagcag ctgctccgcc 
tggctcccaa gcagaaggag tttgtcatca 
tcatgtgtgg ggatggcacc aacgacgtgg 
cgctcttggc caatgcccct gagcgggttg 
caaccctgag caacagtggc atcagagcca 
tccctccctc cgaggagcag ccaacctccc 
acctcgagga cgagagtacg cccattgtga 
tcacctccaa gctctcatcc atccagtgca 
cgctggtgac cacgctacag atgttcaaga 
acagccagag cgtcctctac ctggagggag 
aggggctgct gctggccggc tgcttcctct 
tctcccgaga acggcccctg cccaacatct 
tccagttctt tgtgcacttc ctgagccttg 
gccccgagaa gcaggagcag ttcgtggact 



tagtccaggc tgccatggcg gcgctcaggg 60 
cgttcctctg cccggaagtg agcgcggcgc 120 
gcaacgcggt gccctgcggg gcccggcctt 180 
ccgggccgca gccgcgcgcg ctccttgccg 240 
agctggtggc tgccgtgtgg ccgtaccggc 300 
tgccattcgc cgggctgctt tacccggcct 360 
gctggggcag cagttgggtg cagatccccg 420 
gcctcgcgca cgcgctcact gtcctctcgg 480 
tcacctgcac cccggagtac gaccccagca 540 
ccaacaatgg ctccacggag ctcgtggccc 600 
aggtgctgtc cttcgaattc cagaagatca 660 
agtttctccc cgtggccttt cctgtgggaa 720 
gcttccagga agactcagag atccgagcag 780 
agatggtggt gcctgacttc tcggagcttt 840 
tatttcaggt gttctgtgtg gggctctggt 900 
ttacgctatc catgctggtg gcgttcgagg 960 
tgtcggagat ccggaagatg ggcaacaagc 1020 
agtggaggcc cattgccagt gatgagatcg 1080 
ccccacagga gaacctggtg ccatgtgacg 1140 
acgaggccat gctcacgggg gagtccgtgc 1200 
gcccagaccg ggtgctggac ctccaggctg 1260 
ccaaggtggt gcagcacatc cccccacaga 1320 
gcgggtgcgt ggcctacgtc ctgcggaccg 1380 
gcaccatcct cttcggggtc aagagggtga 1440 
tcctcttcct cctggtgttt gccatcgctg 1500 
aggaccccag ccggaaccgc tacaagctgt 1560 
tcgtgcctcc tgagctgccc atcgagctgt 1620 
tggccaagct ctacatgtac tgcacagagc 1680 
aggtgtgctg ctttgacaag acggggacgt 1740 
tggccgggct gagagacggg aaggaggtga 1800 
accgggccct ggcctcgtgc cactcgctca 1860 
accctctaga gaaggccatg ctgacggccg 1920 
tattcccccg aagtattaaa actcaggggc 1980 
gtgccctgaa gcgaatgtcc gtgcttgcct 2040 
gctacatcgc ggccgtgaag ggggcccccg 2100 
cgcccgacta ccaccacatc cacaccgaga 2160 
tggggtacaa ggagctggga cacctcactc 2220 
acccgctcac tgcatgccac gtggcccagg 2280 
tgatcctgca gcctccctcc gagaaaggcc 2340 
gcatcgtgct gcccctggcc cggggctccc 2400 
gcctcacagg cgacggcttg gcccacctgc 2460 
tcatccccca tgtgcaggtg ttcgcccgtg 2520 
ccagcctgaa ggagctgggc tacgtgaccc 2580 
gcgccctgaa gcatgctgac gtgggtgtgg 2640 
tcgagcggcg acggcggccc cgggacagcc 2700 
cctccaggac agccaagcag cggtcggggc 2760 
agagggaccg cctgagccag gtgctgcgag 2820 
aactggggga tgccagcatc gcagcaccct 2880 
tctgccacgt gatcaagcag ggccgctgca 2940 
tcctggcgct caatgccctc atcctggcct 3000 
tcaagttcag tgacttccag gccaccctac 3060 
tcatctcccg ttccaagccc ctcaagaccc 3120 
tcaacctgta caccatcctc accgtcatgc 3180 
tctacctgta ccgtgaggcc caggcccgga 3240 
tgtacaagga gtttgagcca agcctggtca 33 00 
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acagcaccgt ctacatcatg gccatggcca tgcagatggc caccttcgcc atcaattaca 3360 

aaggcccgcc cttcatggag agcctgcccg agaacaagcc cctggtgtgg agtctggcag 3420 

tttcactcct ggccatcatt ggcctgctcc tcggctcctc gcccgacttc aacagccagt 3480 

ttggcctcgt ggacatccct gtggaggtcc tgctcctgga cttctgcctg gcgctcctgg 3540 

ccgaccgcgt cctgcagttc ttcctgggga ccccgaagct gaaagtgcct tcctgagatg 3 600 

gcagtgctgg tacccactgc ccaccctggc tgccgctggg cgggaacccc aacagggccc 3 660 

cgggagggaa ccctgccccc aaccccccac agcaaggctg tacagtctcg cccttggaag 3720 

actgagctgg gacccccaca gccatccgct ggcttggcca gcagaaccag ccccaagcca 3780 

gcacctttgg taaataaagc agcatctgag attttaaaaa aaaa 3824 

<210> 107 

<211> 3770 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7510661CB1 

<400> 107 

gccagcctgt tctgttgccc tggctcttcc tagtccaggc tgccatggcg gcgctcaggg 60 
cttaccggaa gtaaaacttc ggaagtgagg cgttcctctg cccggaagtg agcgcggcgc 120 
taggaaagat ggcggcagcg gcggcggtgg gcaacgcggt gccctgcggg gcccggcctt 180 
gcggggtccg gcctgacggg cagcccaagc ccgggccgca gccgcgcgcg ctccttgccg 240 
ccgggccggc gctcatagcg aacggtgacg agctggtggc tgccgtgtgg ccgtaccggc 300 
ggttggcgct gttgcggcgc ctcacggtgc tgccattcgc cgggctgctt tacccggcct 360 
ggttgggtgc cgcagccgct ggctgctggg gctggggcag cagttgggtg cagatccccg 420 
aagctgcgct gctcgtgctt gccaccatct gcctcgcgca cgcgctcact gtcctctcgg 480 
ggcattggtc tgtgcacgcg cattgcgcgc tcacctgcac cccggagtac gaccccagca 540 
aagcgacctt tgtgaaggtg gtgccaaccc ccaacaatgg ctccacggag ctcgtggccc 600 
tgcaccgcaa tgagggcgaa gacgggcttg aggtgctgtc cttcgaattc cagaagatca 660 
agtattccta cgatgccctg gagaagaagc agtttctccc cgtggccttt cctgtgggaa 720 
acgccttctc atactatcag agcaacagag gcttccagga agactcagag atccgagcag 780 
ctgagaagaa atttgggagc aacaaggccg agatggtggt gcctgacttc tcggagcttt 840 
tcaaggagag agccacagcc cccttctttg tatttcaggt gttctgtgtg gggctctggt 900 
gcctggatga gtactggtac tacagcgtct ttacgctatc catgctggtg gcgttcgagg 960 
cctcgctggt gcagcagcag atgcggaaca tgtcggagat ccggaagatg ggcaacaagc 1020 
cccacatgat ccaggtctac cgaagccgca agtggaggcc cattgccagt gatgagatcg 1080 
taccagggga catcgtctcc atcggccgct ccccacagga gaacctggtg ccatgtgacg 1140 
tgcttctgct gcgaggccgc tgcatcgtag acgaggccat gctcacgggg gagtccgtgc 1200 
cacagatgaa ggagcccatc gaagacctca gcccagaccg ggtgctggac ctccaggctg 1260 
attcccggct gcacgtcatc ttcgggggca ccaaggtggt gcagcacatc cccccacaga 1320 
aagccaccac gggcctgaag ccggttgaca gcgggtgcgt ggcctacgtc ctgcggaccg 1380 
gattcaacac atcccagggc aagctgctgc gcaccatcct cttcggggtc aagagggtga 1440 
ctgcgaacaa cctggagacc ttcatcttca tcctcttcct cctggtgttt gccatcgctg 1500 
cagctgccta tgtatggatt gaaggtacca aggaccccag ccggaaccgc tacaagctgt 1560 
ttctggagtg caccctgatc ctcacctcgg tcgtgcctcc tgagctgccc atcgagctgt 1620 
ccctggccgt caacacctcc ctcatcgccc tggccaagct ctacatgtac tgcacagagc 1680 
ccttccggat cccctttgct ggcaaggtcg aggtgtgctg ctttgacaag acggggacgt 1740 
tgaccagtga cagcctggtg gtgcgcggtg tggccgggct gagagacggg aaggaggtga 1800 
ccccagtgtc cagcatccct gtagaaacac accgggccct ggcctcgtgc cactcgctca 1860 
tgcagctgga cgacggcacc ctcgtgggtg accctctaga gaaggccatg ctgacggccg 1920 
tggactggac gctgaccaaa gtgccctgaa gcgaatgtcc gtgcttgcct cgtatgagaa 1980 
gctgggctcc accgacctct gctacatcgc ggccgtgaag ggggcccccg aaactctgca 2040 
ctccatgttc tcccagtgcc cgcccgacta ccaccacatc cacaccgaga tctcccggga 2100 
aggagcccgc gtcctggcgc tggggtacaa ggagctggga cacctcactc accagcaggt 2160 
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ggtcatgatc acgggagaca acccgctcac tgcatgccac gtggcccagg agctgcactt 2220 
cattgaaaag gcccacacgc tgatcctgca gcctccctcc gagaaaggcc ggcagtgcga 2280 
gtggcgctcc attgacggca gcatcgtgct gcccctggcc cggggctccc caaaggcact 2340 
ggccctggag tacgcactgt gcctcacagg cgacggcttg gcccacctgc aggccaccga 2400 
cccccagcag ctgctccgcc tcatccccca tgtgcaggtg ttcgcccgtg tggctcccaa 2460 
gcagaaggag tttgtcatca ccagcctgaa ggagctgggc tacgtgaccc tcatgtgtgg 2520 
ggatggcacc aacgacgtgg gcgccctgaa gcatgctgac gtgggtgtgg cgctcttggc 2580 
caatgcccct gagcgggttg tcgagcggcg acggcggccc cgggacagcc caaccctgag 2640 
caacagtggc atcagagcca cctccaggac agccaagcag cggtcggggc tccctccctc 2700 
cgaggagcag ccaacctccc agagggaccg cctgagccag gtgctgcgag acctcgagga 2760 
cgagagtacg cccattgtga aactggggga tgccagcatc gcagcaccct tcacctccaa 2820 
gctctcatcc atccagtgca tctgccacgt gatcaagcag ggccgctgca cgctggtgac 2880 
cacgctacag atgttcaaga tcctggcgct caatgccctc atcctggcct acagccagag 2940 
cgtcctctac ctggagggag tcaagttcag tgacttccag gccaccctac aggggctgct 3000 
gctggccggc tgcttcctct tcatctcccg ttccaagccc ctcaagaccc tctcccgaga 3060 
acggcccctg cccaacatct tcaacctgta caccatcctc accgtcatgc tccagttctt 3120 
tgtgcacttc ctgagccttg tctacctgta ccgtgaggcc caggcccgga gccccgagaa 3180 
gcaggagcag ttcgtggact tgtacaagga gtttgagcca agcctggtca acagcaccgt 3240 
ctacatcatg gccatggcca tgcagatggc caccttcgcc atcaattaca aaggcccgcc 3300 
cttcatggag agcctgcccg agaacaagcc cctggtgtgg agtctggcag tttcactcct 3360 
ggccatcatt ggcctgctcc tcggctcctc gcccgacttc aacagccagt ttggcctcgt 3420 
ggacatccct gtggagttca agctggtcat tgcccaggtc ctgctcctgg acttctgcct 3480 
ggcgctcctg gccgaccgcg tcctgcagtt cttcctgggg accccgaagc tgaaagtgcc 3540 
ttcctgagat ggcagtgctg gtacccactg cccaccctgg ctgccgctgg gcgggaaccc 3600 
caacagggcc ccgggaggga accctgcccc caacccccca cagcaaggct gtacagtctc 3660 
gcccttggaa gactgagctg ggacccccac agccatccgc tggcttggcc agcagaacca 3720 
gccccaagcc agcacctttg gtaaataaag cagcatctga gattttaaaa 3770 

<210> 108 
<211> 1978 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> itiisc_f eature 

<223> Incyte ID No: 7510680CB1 

<400> 108 

cctcctacct gcggctctgg gccctcagcc tggctcatgc acaactgtct gaagtgctct 60 
ggactatggt gatgaacagc ggccttcaga cgcgaggctg gggaggaatc gtcggggttt 120 
ttattatttt tgccgtattt gctgtcctga cagtagccat ccttctgatc atggagggcc 180 
tctctgcttt cctgcacgcc ctgcgactgc actgggaagc tgtttgggga acttgagcta 240 
tttagaagat ggcaaccaag ccaacagagc ctgtcacgat cctcagcctt cggaaattga 300 
gcctggggac cgcagagcca caggttaaag agccaaagac gttcaccgtg gaagatgcag 360 
tggagactat cggcttcggg cgtttccaca ttgccctctt tctgatcatg ggcagtactg 420 
gggtggttga ggccatggag atcatgttga tagctgttgt gtctcctgtc atccgctgtg 480 
aatggcaact ggagaattgg caggtggcat tagtaaccac gatggtgttt tttggctaca 540 
tggttttcag tatcctcttt ggcctcctgg ctgacagata tggccgctgg aagattctgc 600 
tcatctcgtt cctgtgggga gcctatttct ccttgctgac ctcgtttgct ccttcgtaca 660 
tctggtttgt cttcctgcgg acgatggtgg gctgtggtgt gtccggccac tcgcaagggt 720 
taatcataaa gactgaattt ttgcccacga aataccgagg ctatatgtta cccttgtctc 780 
aggtgttctg gcttgcgggc tccctgctca tcattggctt ggcctctgtg atcatcccca 840 
ccatcgggtg gcgctggctc attcgcgtcg cctccatccc gggcatcatc ctcatcgtgg 900 
ccttcaagtt tattcctgaa tctgcccggt tcaatgtctc cactgggaac actcgggctg 960 
ccctggccac tctggagcgc gttgccaaga tgaaccgctc ggtcatgccg gaggggaagc 1020 
tggtggagcc cgtcctggaa aaaagaggaa gatttgcaga cctattggat gctaaatatt 1080 
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tacggaccac attacagatc 
ttatcctggc cagtgctgag 
ctgcggtggt ggtgactggg 
tgtttgcacc ctctgactat 
atcctttaaa tatactgggc 
tgggatgcac ggctttattc 
ttggcttcct cttcatgctg 
acacagctga ggttcttatg 
tctgtgttgt atgcgccatt 
tccagcaaat taaatgaaga 
atcttcagaa ctgcggtgca 
aattagtaaa gtgatttttt 
ttgatctctt ccttgaggaa 
caaaggggtg tgagagggag 
tctgtaaacc tttatatgaa 

<210> 109 
<211> 1622 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 7505145CB1 

<400> 109 

tgcctggtgc cctgttccgc gtctgtgcga ccgtccgtcc cgagcgcgca ccgcggcagg 60 

ccaggtcaga gcagcccacc atgggatggg gagggggtgg aggctgcacc ccgcgcccac 120 

ccatccacca gcagccgccg gagcgccgcg tggtcaccgt tgtctttctc ggcctcctgc 180 

tggacctcct ggccttcacg ctgctgctgc ccctgctgcc cgggctgttg gagagccacg 240 

gccgtgccca cgaccccctc tatggctcct ggcagggcgg ggtggactgg tttgccaccg 300 

ccatcgggat gccagtggag aagaggtaca acagtgtcct gttcggaggt ctcattggct 360 

cggcattctc tgtcctgcag tttctgtgtg cgccactcac tggggccacc tctgactgct 420 

tggggaggcg cccggtgatg ctgctgtgcc tgatgggtgt ggccacctca tatgcagtct 480 

gggccacctc tcggagcttt gcggccttcc tggcctccag gctgattggg ggcatcagca 540 

aagggaacgt cagcctctcc acggccatcg ttgctgacct gggctcgcct ctggcccgca 600 

gtcaaggcat ggcggtcatt ggggtggcct tctcactggg cttcaccctg ggccctatgc 660 

tcggagcctc cctgcccctg gaaatggcac cctggtttgc cctgctcttc gcagcctccg 720 

acctgctgtt catcttctgc ttcctgccag agacgctgcc cctggagaaa cgggcgccct 780 

ctatcgccct ggggttccgt gatgcggctg atctgctcag ccccctggcc ctgctgcgct 840 

tctcggctgt cgctcgtggc caggacccac cctctggaga caggctcagc agcctgcgcc 900 

gcctgggcct agtctacttc ctctacctct tcctgttctc gggcctggag tacacgctga 960 

gcttcctcac acaccagcgc ttccagttca gtaggccctc ctgctgctgg tgcccgcctt 1020 

cctcctcatc ggctggggac gttctctgcc cgtgctgggc ctggggctgc tgctctactc 1080 

ctttgccgcc gccgttgtgg tgccctgcct gtcctccgtg gtcgctggct atggctcacc 1140 

agggcagaag ggcacggtca tgggtacact gcgcagccta ggtgctctgg ccagggccgc 1200 

ggggcccctg gtggccgctt cagtgtactg gctggccggg gcccaggcct gcttcaccac 1260 

gtggtccggg ctctttttgc tccccttctt cctcctgcag aagctgagtt acccggcaca 1320 

gacgctcaag gctgagtagc tgagccactg tgcccaggct gtgggcacca ggcagagtgg 1380 

gagcctaggt caggcccctg cccactgcct gacccccacc ccccgccagt ccagggagac 1440 

cctgtgggtg ggggccggcc cctaagcagg aagctcaggc agctcctcca gacttactta 1500 

ctccttcagt gactccgagc tgcagcactc caaggctgtc agggcttctg tttgtttttt 1560 

aaactatgca ccaggtttct gatgatgaaa taaagcacct gtttgtttta taaaaaaaaa 1620 
aa 1622 

<210> 110 



tgggtcatat ggcttggaat ctcttttgcc tactatgggg 1140 
ctgctggagc gggacttggt ctgtggttca aagtcagact 1200 
ggggactcag gggagagcca gagcccctgc tactgccaca 1260 
cggaccatga tcatcagcac catcggtgaa attgctttga 1320 
atcaatttcc tgggaagacg gctgagcctt tctattacca 1380 
ttccttctcc tcaacatttg cacttcaagt' gccggcctga 1440 
agggctctgg tagctgcaaa cttcaacacc gtctacattt 1500 
agtgcatcaa tactgggggc cctgtgtctc ttctcatctg 1560 
tctgcattca ctctccccat cgaaaccaaa ggacgggccc 1620 
cctgcaaagc tatgtctacc agatgagaaa aatgaattct 1680 
tttttttaaa acttggtttt acttctgtat gctactcggt 1740 
ttttaaaggc atatatggga atggggtagg taactgtata 1800 
caatatataa agtactttta taaaatataa tttaagcttt 1860 
atggtggggg ggaagatggc ttttcttcat tgaaatcaag 1920 
taaatactaa attttaaact tacaaaaaaa aaaaaagg 1978 
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<211> 1982 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7505162CB1 

<400> 110 

ggctcccgcc tctgttcagg acactgggtc cccttggagc ctccccaggc ttaatgattg 60 
tccagaaggc ggctataaag ggagcctggg aggctgggtg gaggagggag cagaaaaaac 120 
ccaactcagc agate tggga actgtgagag eggcaagcag gaactgtggt cagaggctgt 180 
gcgtcttggc tggtagggee tgetctttte taccatggca gcccagggct atggctatta 240 
tcgcactgtg atcttctcag ccatgtttgg gggctacagc ctgtattact teaategcaa 300 
gaccttctcc tttgtcatgc catcattggt ggaagagatc cctttggaca aggatgattt 360 
ggggttcatc accagcagcc agteggcage ttatgetate agcaagtttg tcagtggggt 420 
gctgtctgac cagatgagtg ctcgctggct cttctcttct gggctgctcc tggttggcct 480 
ggtcaacata ttctttgect ggagctccac agtacctgtc tttgctgccc tctggttcct 540 
taatggcctg gcccaggggc tgggctggcc cccatgtggg aaggtcctgc ggaagtggtt 600 
tgagecatet cagtttggca cttggtgggc catcctgtca accagcatga acctggctgg 660 
agggctgggc cctatcctgg caaccatcct tgcccagagc tacagctggc gcagcacgct 720 
ggccctatct ggggcactgt gtgtggttgt ctccttcctc tgtctcctgc tcatccacaa 780 
tgaacctget gatgttggac tccgcaacct ggaccccatg ccctctgagg gcaagaaggg 840 
ctccttgaag gaggagagca ccctgcagga getgetgetg tccccttacc tgtgggtgct 900 
ctccactggt taccttgtgg tgtttggagt aaagacctgc tgtactgact ggggccagtt 960 
cttccttatc caggagaaag gacagtcagc ccttgtaggc gggactgtcc aactaeggga 1020 
accctcgcca tggcctgttg ctgttcatga tggctggcat gacagtgtcc atgtacctct 1080 
teegggtaac agtgaccagt gactccccca agctctggat cctggtattg ggagctgtat 1140 
ttggtttctc ctegtatgge cccattgccc tgtttggagt catagccaac gagagtgece 1200 
ctcccaactt gtgtggcacc tcccacgcca ttgtgggact catggccaat gtgggcggct 1260 
ttctggctgg gctgcccttc agcaccattg ccaagcacta cagttggagc acagccttct 1320 
gggtggctga agtgatttgt gcggccagca cggctgcctt cttcctccta cgaaacatcc 1380 
gcaccaagat gggccgagtg tccaagaagg ctgagtgaag agagtccagg ttceggagea 1440 
ccatcccacg gtggccttcc ccctgcacgc tetgegggga gaaaaggagg ggcctgcctg 1500 
gctagccctg aacctttcac tttccatttc tgcgcctttt ctgtcacccg ggtggcgctg 1560 
gaagttatca gtggctagtg aggtcccagc tccctgatcc tatgetctat ttaaaagata 1620 
acctttggcc ttagactccg ttagctccta tttcctgcct tcagacaaac aggaaacttc 1680 
tgcagtcagg aaggctcctg tacccttctt cttttcctag gccctgtcct gcccgcatcc 1740 
taccccatcc ccacctgaag tgaggctatc cctgcagctg cagggcacta atgacccttg 1800 
acttctgetg ggtcctaagt cctctcagca gtgggtgact gctgttgcca atacctcaga 1860 
ctccagggaa agagaggagg ccatcattct cactgtacca etaggegcag ttggatatag 1920 
gtgggaagaa aaggtgactt gttatagaag attaaaacta gatttgatac tgaaaaaaaa 1980 
aa 1982 

<210> 111 

<211> 2231 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7505469CB1 

<400> 111 

ggcttatgaa agtgtgatgt tegegtattt cttgccaggc agtggcgtga tcttggctca 60 
ctgcaacctc cgactccctg gttcaagega ttctcctgcc tcagcctcct gagtggggat 120 
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tacaggccac agcaaacaca ggtgtgcagg 
ggaggcccac acccacctac catcttgtcc 
atgtcagctc gccacctcaa aggtcctccg 
tgctgaatgg ggtcagcctg gtggtgggca 
ccaagggtgt gctggtacac actgcctcct 
gtgggctctt ctctgttgtg ggtgcccttt 
agtcgggagc cagctacgct tatattctag 
gcctgtgggt ctcactgcta gttgttgagc 
ttgccaacta catcatccag ccgtccttcc 
gtctcctggc tgctgcttgc atatgtctgc 
ggggcacacg tgtgcaggac acgttcactt 
ttgtcatggg ccttgttaaa ctgtgccagg 
gccaattgtg acgctcatct acatcctgac 
ttcagatgtc cttagcagtg atgctgtggc 
gttcagctgg accatcccca ttgctgttgc 
catctttgct tcatcaaggt tgttcttcgt 
tctgtccatg atccacattg agcgttttac 
catggcactc atctacctca tcgtggagga 
cagctactgg ttcttcgtgg gcctgtctgt 
gcccaagcgg ccccggcctc tcaagctgag 
ctccgtgttt ctggtgatag tgcccctctt 
cgggattgcc ctttctggag tccctttcta 
gaggccattg tttattcgga atgtcctggc 
cttttgtgtc ctgactgagc ttgatgtagc 
ctagaggtca gaggtggctt tctgaggcct 
aacaagactc tgtgggccca actctcctga 
gtggggctca gggccagtgc tcactcttat 
tgtgggtcac cgggacagtg tactcctctc 
ggctgcagtc tcaggaagag cttggtactt 
tcagtgaaga ctgggaggaa agctgcttca 
aagtggaagc aactaattct ggtgctcagg 
tggcctaatg gatcttactg tatgagcagg 
tgatgccccc agacactgtc atcctaggcc 
tgggcttatc cttaggtttt ggaattggtc 
tcttctccac ccagtcacag ataagggaat 
ttgaaggaag c 



aaccgtttgt catggaagcc agggagcctg 180 
ctaacaccag ccagtcccag gtggaagaag 240 
aaactatgca gctgaagaag gagatctccc 300 
acatgatcgg ctcagggatc tttgtctcac 360 
atgggatgtc actgattgtg tgggccattg 420 
gttatgcaga gctggggacc accatcacca 480 
aggcctttgg gggcttcatt gccttcatcc 540 
ccaccggtca ggccatcatc gccatcacct 600 
ccagctgtga tcccccatac ctggcctgcc 660 
tgacatttgt gaactgtgcc tatgtcaagt 720 
acgccaaggt cgtagcgctc attgccatca 780 
aaatttgccc ttggccattg ggatttctat 840 
caatgtggcc tattacacag tgctgaacat 900 
tgtgacattt gctgaccaga cgtttggcat 960 
cctgtcctgc tttgggggcc tcaatgcatc 1020 
gggctcccgg gagggccacc taccggacct 1080 
acctatccct gctttactgt tcaattgcac 1140 
tgttttccag cttatcaact acttcagctt 1200 
tgttggacag ctctacctcc gctggaagga 1260 
cgtgtttttc cccatcgtgt tctgcatatg 1320 
cactgacacc attaattccc tcattggcat 1380 
cttcatgggt gtttacctgc cagagtcccg 1440 
tgctatcacc agaggcaccc agcagctttg 1500 
cgaagaaaaa aaggatgaga ggaaaactga 1560 
ggaaggcagg ccaaccagca aaatcctgat 1620 
attaaaggag ccttttgacc caatcatata 1680 
tggctttgtt tacacttgga gccaccttgg 1740 
ctgccagcct ccccttcccc gaggtgtggt 1800 
gtggggactt ctgttttctc cctgtggaga 1860 
acctgagtcc ggctcttcag caggctgcac 1920 
ctgggctctc cacccaagtt aggcctgctc 1980 
acggctgcat tggattgtac aactgttttg 2040 
gagaagaacc tgctagcttg acatacccca 2100 
aacagtgagg cagtctccct tcctgaccat 2160 
aaccttggcc atatatttgc tcaataaaga 2220 

2231 



<210> 112 

<211> 5170 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7505475CB1 



<400> 112 

agctgctgga gtaggcaccc atttaaagaa 
aatcgttacc tagacaaaca gagaactggt 
attttcctga cagttgtgtt ccaccatgat 
aatatgaaac agaaaagcgt gtatcagcaa 
aagaaatgga ggatgaaaag agagagctta 
ctgtgtattg ctctgttttc cagttccatg 
cagaatctgg gaagggtaga taaatttaat 
atatctaatt taacccagca gataatgaat 
acaagtgtca ttggggcacc aaataaaaca 



aaaatgaaga agcagcaata aagaagttgt 60 
tttgacagtg tttctagagt gctttttatt 120 
tactttctcc ttcagcgaat aggctaaatg 180 
accaaagcac ttctgtgcaa gaattttctt 240 
ttggaatggg gcctctcaat acttctagga 300 
agaaatgtcc agtttcctgg aatggctcct 360 
agctcttctt taatggttgt gtatacacca 420 
aaaacagcac ttgctcctct tttgaaagga 480 
cacatggacg aaatacttct ggaaaattta 540 
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j 

ccatatgcta tgggaatcat ctttaatgaa 
ggatataaca gtccactttg gaaagaagat 
gagttttcat gtacattgac caaatactgg 
attaatactg ccattataga aatcacaacc 
gttactgcta taactatgaa gacattacct 
atgtttattt tattcttctt gcttcatttc 
gtaacaaaag agagaaaaaa gtctaagaat 
gcattctggc tctcctgggg tctaatctat 
attacaatta tcataacatt cacccaaatt 
ataccctttt ttttatatgg cttatctttg 
ttaaagaaag ctgtcctcac caatttggtt 
ctgggattca ctgtatttta tgaacaactt 
tgtagccctt ttgcctttac tactggaatg 
aatggtgtaa tttttcctga cccttcagga 
atgttgcttt tggatggtct catctacttg 
ccctatggag atgagcgcca ttattctcct 
caacaccaaa ggactaatgc taaggttatt 
gatgattatt ttgaaccagt agctcctgaa 
aatgttaaga aggaatataa aggaaaatct 
tttgacatat atgaaggtca aatcacggca 
tcactgctaa atattcttaa tggattgtct 
aataaaaatc tctctgaaat gcaagacttg 
cctcaattca atgttcaatt tgacatactc 
aaaataaaag ggattcatct aaaggaagtg 
ttggacatgc aaaacattca agataacctt 
aagctgactt ttgggattac cattttagga 
atgggagact gaagtgtgca ggttcttcta 
atcacctaag tttacatagg aatgaaatat 
ctcatcacat ccccgatgct aaattaaaaa 
tgccactgga aaggacaaat acatttccag 
accagggagt gacaggttat gacatttcca 
tggaaggaca gtcaactatc gaacaagatt 
aaagcctcaa tgaaatggag ctggctcact 
gtgacatggg cctctggaga atgcaagtct 
taaaacgtca aactaaagtg ttattgaccc 
ctttgattgt tgaaaatata atatatgcta 
aaaacgaatt gtattttctc tctcctggac 
tgatcatcaa taacacagaa tcaaatattg 
atatactttt ggaagtagat gactttgaaa 
atggagctat catagtttct ggtaaacaaa 
ccaagagatt gcactgtttt ccaattctta 
tgtttaatca cacacaacat attcgaattg 
gactctggac tgggttgccg gatggttcct 
ctccttatat caccatgggc agcatcagtg 
ggatttcagg cctctacact tctgcttact 
tcttcatttt aattctcctt ttaatgtatt 
ttcttattac aagccaaatt gtgtttgctt 
ctcttgtctt cttcatatat atgatatcat 
gcctttggtc attttacttc ttttttgcct 
atcattttga cctaagtata ttgattacca 
ttggatttaa aacttttttg gaagtgagag 
caaattttga attgagtgcc actgattttc 
tgctattcgt ttttgttcta agatgcatgg 
aagatcctgt tttcagaatt tccccccaaa 
ccatagatga agatgaagat attcaaacag 
cttcaatctt agatgagaaa cctgttataa 



actttctctt ataagttaat atttttccag 600 
ttctcagctc attgctggga tggatatggt 660 
aatagaggat ttgtggcttt acaaacagct 720 
aatcaccctg tgatggagga gttgatgtca 780 
ttcataacta aaaatcttct tcacaatgag 840 
tccccacttg tatattttat atcactcaat 900 
ttgatgaaaa tgatgggtct ccaagattca 960 
gctggcttca tctttattat ttccatattc 1020 
atagtcatga ctggcttcat ggtcatattt 1080 
gtagctttgg tgttcctgct gagtgtgctg 1140 
gtgtttctcc ttaccctctt ttggggatgt 1200 
ccttcatctc tggagtggat tttgaatatt 1260 
attcagatta tcaaactgga ttataacttg 1320 
gactcatata caatgatagc aactttttct 1380 
ctattggcat tatactttga caaaatttta 1440 
ttatttttct tgaattcatc atcttgtttc 1500 
gagaaagaaa tcgatgctga gcatccctct 1560 
ttccaaggaa aagaagccat cagaatcaga 1620 
ggaaaagtgg aagcattgaa aggcttgctc 1680 
atcctgggtc acagtggagc tggcaaatct 1740 
gttccaacag aaggatcagt taccatctat 1800 
gaggaaatca gaaagataac tggcgtctgt 1860 
accgtgaagg aaaacctcag cctgtttgct 1920 
gaacaagagg tacaacgaat attattggaa 1980 
gctaaacatt taagtgaagg acagaaaaga 2040 
gatcctcaaa tagaaaagtg atcatgtcca 2100 
tgtttttgaa aagaaggtgg ggtcttggat 2160 
gtaacccaga acaaataaca tccttcatta 2220 
cagaaaacaa agaaaagctt gtatatactt 2280 
atcttttcag tgatctggat aagtgttctg 2340 
tgtcaactct aaatgaagtc tttatgaaac 2400 
tcgaacaagt ggagatgata agagactcag 2460 
cttccttctc tgaaatgcag acagctgtga 2520 
ttgccatggc acggctccgt ttcttaaagt 2580 
tattattggt atttggaatc gcaatattcc 2640 
tgttaaatga aaagatcgat tgggaattta 2700 
aacttcccca ggaaccccgt accagcctgt 2760 
aagattttat aaaatcactg aagcatcaaa 2820 
acagaaatgg tactgatggc ctctcataca 2880 
aggattatag attttcagtt gtgtgtaata 2940 
tgaatattat cagcaatggg ctacttcaaa 3000 
agtcaagccc atttcctctt agccacatag 3060 
ttttcttatt tttggttcta tgtagcattt 3120 
attacaagaa aaatgctaag tcccagctat 3180 
ggtgtgggca ggcactagtg gacgtcagct 3240 
taattttcta catagaaaac atgcagtacc 3300 
tggttatagt tactcctggt tatgcagctt 3360 
ttatttttcg caaaaggaga aaaaacagtg 3420 
ccaccatcat gttttccatc actttaatca 3480 
ccatggtatt ggttccttca tataccttgc 3540 
accaggagca ctacagagaa tttccagagg 3600 
tagtctgctt cataccctac tttcagactt 3 660 
aactaaaatg tggaaagaaa agaatgcgaa 3720 
gtagagatgc taagccaaat ccagaagaac 3780 
aaagaataag aacagccact gctctgacca 3840 
ttgccagctg tctacacaaa gaatatgcag 3900 
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gccagaagaa aagttgcttt tcaaagagga 
tctgtgttca agaaggtgaa attttgggat 
catctattag aatgatatct gggatcacaa 
gctgcagttc agttttgggc cacctggggt 
tgctgacgtt gagggaacac ctggaggtgt 
acgcgaggct cgccatcgca agattagtga 
ttcctgtgca gaaattaaca gcaggaatca 
tgggaaactc acctgtcttg ctcctggatg 
agcagcaaat gtggcaggca atccaggcag 
tgaccaccca taacctggct gaggcggaag 
ctggaaggct tagatgcatt ggctccatcc 
acattctaga gctaaaagtg aaggaaacgt 
tgaagctttt cccacaggct gcagggcagg 
tgcccgtggc agacgtttac cctctatcac 
ataactttaa cctggaagaa tacagccttt 
agctttctaa agaacaggaa gtaggaaatt 
ggaaactcct ccctcattca gatgaacctt 
tctcctataa acttatgttt tatgtaataa 
taaaattaac atcaggtata ttttgtaaat 
attcttcctc tcaaacatag gggtgatagc 
agtaaagtca cccaaagatt caggcactgg 
tagcctcttc 



agaagaaaat agcagcaaga aatatctctt 3960 
tgctaggacc cagtggtgct ggaaaaagtt 4020 
agccaactgc tggagaggtg gaactgaaag 4080 
actgccctca agagaacgtg ctgtggccca 4140 
atgctgccgt caaggggctc aggaaagcgg 4200 
gtgctttcaa actgcatgag cagctgaatg 4260 
cgagaaagtt gtgttttgtg ctgagcctcc 4320 
aaccatctac gggcatagac cccacagggc 4380 
tcgttaaaaa cacagagaga ggtgtcctcc 4440 
ccttgtgtga ccgtgtggcc atcatggtgt 4500 
aacacctgaa aaacaaactt ggcaaggatt 4560 
ctcaagtgac tttggtccac actgagattc 4620 
aaaggtattc ctctttgtta acctataagc 4680 
agacctttca caaattagaa gcagtgaagc 4740 
ctcagtgcac actggagaag gtattcttag 4800 
ttgatgaaga aattgataca acaatgagat 4860 
aaaacctcaa acctagtaat tttttgttga 4920 
ttaatagtat gtttaatttt aaagatcatt 4980 
ttagttaaca aatacataaa ttttaaaatt 5040 
aaacctgtga taaaggcaat acaaaatatt 5100 
aattatggac taaaacggta tatctagttt 5160 

517a 



<210> 113 

<211> 1876 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7505568CB1 



<400> 113 

gccttggcag agtctggggt ccctggactg 
caaggaaaca aaataggaat tccaaggaac 
gccacgccgg gcctccaggg ccagggaggg 
gggtgccagg tggaaggaga agaaaggact 
cgggcgcctt cggctcctcc ttcctctacg 
ccccggaagc acactttgct ggccaataat 
gcctgctcgc tccaggcagg agcctttgaa 
atagatggag gcgtcgccct cagtgtgctc 
gagatccgtg gctctctggg gcaggtgact 
gggcagcttc tgggcctgcc cgagctgctg 
ggagtgattg tggtccctgc cgttgtccag 
ccacgctacc tgctcttgga gaagcacaac 
ttcttgggta aagcagacat ttcccaagag 
cagaggagca tccgcctggt gtccgtgctg 
caggtggtca ccgtgattgt caccatggcc 
tggttctata ccaacagcat ctttggaaaa 
gtcaccttga gtacaggggg catcgagact 
gagcacctgg gacggagacc cctcctcatt 
gggaccctca ccatcacgct gaccctgcag 
tatcgtgggc attctggcca tcatcgcctc 
tcatcttgac tggtgagttc ttccagcaat 
gcaccgtcaa ctggctctcc aactttgctg 
gtctggacac ctactgtttc ctagtctttg 



agccatcagc tgggtcactg agacccatgg 60 
tgggcctagt tcccctcaca gatgacacca 120 
cactgctgga gtgtgaccac ctgaggagtg 180 
ggtcctgctc gctcctcgtg gcctccctcg 240 
gctacaacct gtcggtggtg aatgccccca 300 
gggtttgcaa tttctgctgc attgctgatg 3 60 
atgctcatcg tgggacgctt catcatgggc 420 
cccatgtacc ttagtgagat ctcacccaag 480 
gccatcttta tctgcattgg cgtgttcact 540 
ggaaaggaga gtacctggcc atacctgttt 600 
ctgctgagcc ttccctttct cccggacagc 660 
gaggcaagag ctgtgaaagc cttccaaacg 720 
gtagaggagg tcctggctga gagccgcgtg 780 
gagctgctga gagctcccta cgtccgctgg 840 
tgctaccagc tctgtggcct caatgcaatt 900 
gctgggatcc ctccggcaaa gatcccatac 960 
ttggctgccg tcttctctgg tttggtcatt 1020 
ggtggctttg ggctcatggg cctcttcttt 1080 
gaccacgccc ccctgggtcc cctacctgag 1140 
tttctgcagt gggcccaggt ggcatcccgt 1200 
ctcagcggcc ggctgccttc atcattgcag 1260 
ttgggctcct cttcccattc attcagaaaa 1320 
ctacaatttg tatcacaggt gctatctacc 1380 
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tgtattttgt gctgcctgag accaaaaaca 

ccaaaaggaa caaagcatac ccaccagaag 

ctcaaaggaa Qtaagacaaa gatcatggag 

gctctgcttg gctggtctcc tgctggtatt 

attttatctt gcttacctgc acttatgaaa 

aacataggag tcagttcttc tgcagcagca 

ggaaggagag cagtgatgca gtgatgctgg 

accagctgcc atacaccagg caacttctac 

ttagtttccc catctt 



gaacctatgc agaaatcagc caggcatttt 1440 
agaaaatcga ctcagctgtc actgatgcag 1500 
accatcgggt gagtctcaag acttccccca 1560 
ttctgtctgt agagaggaac aagaacttcc 1620 
agtcaaactg agtcatgctg agagccaggg 1680 
ctcagccagt tgaaggcaat gtggagtgat 1740 
caccaactcc tttactatgg catccattgt 1800 
actttatctc taatcatcct agaataagta 1860 

1876 



<210> 114 
<211> 1602 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7506953CB1 



<400> 114 

ctctctctct ctctctctct ctctctctct 
tcaagatcag ggcaaaagaa gaaaacaccg 
atgaagacaa aattgaacat ctacaacatg 
gaccctgcca ggttggtgct ggctaacatc 
atctttacga gaattcttga cagacttctg 
ctgggagatg catatacaac ttcagaggtc 
tcagtacagg ttgctcctga tggctctagg 
atcggaaagg agacaattaa atccagtaca 
cacctgaaaa gaaaaattgg gtattttgtg 
gtcattctct cccaagtttc attctggctt 
tttggagtaa caactgtcct aacaatgaca 
cccaaagtgg cttatgcaac tgccatggac 
ttctctgccc taattgaatt tgcaactgtt 
gatgggaaga gtgtagtaaa tgacaagaaa 
aacgcttatg cagtggctgt tgccaattat 
tccaccatct ccaagagtgc aaccacgcca 
gctgaagcaa agaaaacttt caacagtgtt 
tttccagttt tgtttggtac ctttaattta 
cctgtattag gggtcagtcc ttgaattgag 
attaaatttg gtttgttttg ctatgtacag 
acatgtacag tatgtatata gtgacatagc 
tttgctaact catggaactg cagacagaaa 
tttaaagatt taccctagga cctgatttaa 
tattcttcca aatgagatga aaatggggat 
ttagctctta agtaggggta ttttctactg 
ttctagatga atctttgaag taccaacatt 
aggtccctgt aatgtattgg aagctggtac 



ctctctctct ctcccaagtt tcctatctcg 60 
aattctgctt gccgtttcag agcggcggtg 120 
cagttcctgc tttttgtttt cttggtgtgg 180 
caagaagatg aggctaaaaa taacattacc 240 
gatggttacg ataatcggct tagaccagga 300 
acttatattt ggacttacaa tgcatctgat 360 
ttaaatcaat atgacctgct gggccaatca 420 
ggtgaatata ctgtaatgac agctcatttc 480 
attcaaacct atctgccttg catcatgact 540 
aacagagaat ctgtgcctgc aagaactgtg 600 
actctaagca tcagtgctcg gaattctctc 660 
tggtttattg ctgtttgtta tgcatttgtg 720 
aattacttca ccaaaagagg atgggcttgg 780 
aaagaaaagg cttccgttat gatacagaac 840 
gccccgaatc tttcaaaaga tccagttctc 900 
gaacccaaca agaagccaga aaacaagcca 960 
agcaaaattg acagaatgtc cagaatagtt 1020 
gtttactggg ctacatattt aaacagagaa 1080 
acccatgtta tctttgggat gtatagcaac 1140 
tctgactaat aactgctaat ttgtgatcca 1200 
ttaccagtag acctttaatg gagacatgca 1260 
gcactccatg cgaaaacagc cattgccttt 1320 
agtgaatttc aaatgacctg attaatttcc 1380 
cctgtacaac cctttgtgga cacttttggt 1440 
ttgcctaata tgatggaagt aacattgtca 1500 
gttctggaat cagctcggtc cagatgctca 1560 
cctaagaaaa ca 1602 



<210> 115 
<211> 2173 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7510176CB1 
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<400> 115 

cactattgaa gccacctgct caggacaatg aaattcttca gttacattct ggtttatcgc 60 
cgatttctct tcgtggtttt cactgtgttg gttttactac ctctgcccat cgtcctccac 120 
accaagctta ttctgacttt tccaagatga agatgaagag caggccagtg gcatgggcca 180 
gatgcagtca gatgactgac tgaccaggag ggcctccgtc cagtgaccag tctcagaatt 240 
taaaggaccc ttccccctta cttatgcttg tactcagccc aggaagcaga atgtgcctac 300 
acactctttg tggtcgccac attttggctc acagaagcat tgcctctgtc ggtaacagct 360 
ttgctaccta gtttaatgtt acccatgttt gggatcatgc cttctaagaa ggtggcatct 420 
gcttatttca aggattttca cttactgcta attggagtta tctgtttagc aacatccata 480 
gaaaaatgga atttgcacaa gagaattgct ctgaaaatgg tgatgatggt tggtgtaaat 540 
cctgcatggc tgacgctggg gttcatgagc agcactgcct ttttgtctat gtggctcagc 600 
aacacctcga cggctgccat ggtgatgccc attgcggagg ctgtagtgca gcagatcatc 660 
aatgcagaag cagaggtcga ggccactcag atgacttact tcaacggatc aaccaaccac 720 
ggactagaaa ttgatgaaag tgttaatgga catgaaataa atgagaggaa agagaaaaca 780 
aaaccagttc caggatacaa taatgataca gggaaaattt caagcaaggt ggagttggaa 840 
aagaactcag gcatgagaac caaatatcga acaaagaagg gccacgtgac acgtaaactt 900 
acgtgtttgt gcattgccta ctcttctacc attggtggac tgacaacaat cactggtacc 960 
tccaccaact tgatctttgc agagtatttc aatacacgct atcctgactg tcgttgcctc 1020 
aactttggat catggtttac gttttccttc ccagctgccc ttatcattct actcttatcc 1080 
tggatctggc ttcagtggct tttcctagga ttcaatttta aggagatgtt caaatgtggc 1140 
aaaaccaaaa cagtccaaca aaaagcttgt gctgaggtga ttaagcaaga ataccaaaag 1200 
cttgggccaa taaggtatca agaaattgtg accttggtcc tcttcattat aatggctctg 1260 
ctatggttta gtcgagaccc cggatttgtt cctggttggt ctgcactttt ttcagagtac 1320 
cctggttttg ctacagattc aactgttgct ttacttatag ggctgctatt ctttcttatc 1380 
ccagctaaga cactgactaa aactacacct acaggagaaa ttgttgcttt tgattactct 1440 
ccactgatta cttggaaaga attccagtca ttcatgccct gggatatagc cattcttgtt 1500 
ggtggagggt ttgccctggc agatggttgt gaggagtctg gattatctaa gtggatagga 1560 
aataaattat ctcctctggg ttcattacca gcatggctaa taattctgat atcttctttg 1620 
atggtgacat ctttaactga ggtagccagc aatccagcta ccattacact ctttctccca 1680. 
atattatctc cattggccga agccattcat gtgaaccctc tttatattct gataccttct 1740 
actctgtgta cttcatttgc attcctccta ccagtagcaa atccacccaa tgctattgtc 1800 
ttttcatatg gtcatctgaa agtcattgac atggttaaag ctggacttgg tgtcaacatt I860- 
gttggtgttg ctgtggttat gcttggcata tgtacttgga ttgtacccat gtttgacctc 1920 
tacacttacc cttcgtgggc tcctgctatg agtaatgaga ccatgccata ataagcacaa 1980 
aatttctgac tatcttgcgg taatttctgg aagacattaa tgattgactg taaaatgtgg 2040 
ctctaaataa ctaatgacac acatttaaat cagttatggt gtagctgctg caattcccgt 2100 
gaatacccga aacctgctgt tataactcag agtccatatt tgttattgca gtgcaactaa 2160 
agagcatcta tgt ~ 21?3 

<210> 116 

<211> 1826 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7510541CB1 

<400> 116 

cccacgcgtc cgcccacgcg tccgcccacg 
gcgaaccgca ccggcccgag cggagcgccg 
actgttggtg tgagaccagt gctcctggtg 
agacagagat ggtggagctg gtgcccaatg 
tcacccccat ggcaggcaac cagagggtcg 
agagcttcct acagaaaagt cccagcaagg 
catcattcgg gatgtcagtg ttcaacctca 



cgtccgcagc cgagttcagc cgggagcaga 60 
cacgttccca accgcgaggc cagacatctg 120 
gtgtgccctg agccatggag gcgcctttgc 180 
gcaaacactc agaggggctg ctcccggtca 240 
aggaccctgc acggagctgt atggagggca 300 
agccacactt cactgacttc gaggggaaga 360 
gcaatgccat catgggcagc ggcatcctgg 420 
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gactcgccta tgccatggcc aatacgggca ttatcctttt cctgcatccg tgcctatgag 480 
cagctgggct accgtgcctt tgggacccca ggaaagctgg cagcagcctt ggccatcacg 540 
ctccagaaca tcggagccat gtccagctac ctgtacatca tcaagtctga gctgccactt 600 
gtcatacaga ccttcctgaa cctggaggag aaaacctcgg actggtacat gaacgggaac 660 
tacctggtaa tccttgtctc tgtcaccatc attctgcccc tggcactgat gcggcagctt 720 
ggctacctgg gctactccag cggcttctct cttagctgca tggtgttctt cctaattgca 780 
gtcatctaca aaaagttcca cgtgccctgc ccactgcccc ccaacttcaa caacaccaca 840 
ggcaacttca gccacgtgga gatcgtgaag gagaaggtgc agctgcaggt cgagcctgag 900 
gcttcagcct tctgcactcc cage tact tc acgctcaact cacagacagc atacaccatc 960 
cccatcatgg ccttcgcctt cgtctgccac cccgaggtgc tgcccatcta tactgagctc 1020 
aaggacccct ccaagaagaa gatgeagcac atctccaacc tgtccatcgc tgtcatgtac 1080 
atcatgtact tcctggctgc cctcttcggc tacctcacct tetacaaegg ggtggagtcg 1140 
gagctgetge acacctacag caaggtggac ccgtttgacg tcctgatcct gtgtgtgcgc 1200 
gtggccgtgc tgacagcagt cacgctcaca gtgcccatcg ttctgttccc ggtgcgccgc 1260 
gccatccagc agatgctgtt tccaaaccag gagttcagct ggctgeggea tgtgcttatt 1320 
gccgttggcc tgctcacttg tatcaacctg ctggtcatct ttgcccccaa catcctgggc 1380 
atctttgggg tcatcggtgc cacatctgcc ccattcctca tcttcatctt ccctgccatc 1440 
ttctacttcc gaatcatgee caeggagaag gagectgeaa gatccacccc caaaatcctg 1500 
gccctgtgtt ttgetatget tggcttcttg ctgatgacca tgagcttgag cttcatcatc 1560 
attgactggg cctcagggac cagccggcat ggaggaaacc actagggtga ccctcatcct 1620 
gttctgtcta ctcaccctag cagccctgcc cagactcttc agcccctgct cccatccagt 1680 
ggecagtegg gggaggagaa agacgegatt aacactgtgg cattcagcca ggccccatgt 1740 
cctctctgtg gaaggttttt gttcaagagc caggaccaag gcccttgggc cactaccctg 1800 
ctaggctctg gagctgtaga ggcttc 1826 

<210> 117 

<211> 2052 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7510923CB1 

<400> 117 

cccacgcgtc cgcccacgcg tccgcccacg cgtccgcagc cgagttcagc egggagcaga 60 
gcgaaccgca ccggcccgag cggagcgccg cacgttgcca accgcgaggc cagacatctg 120 
actgttggtg tgagaccagt gctcctggtg gtgtgccctg agecatggag gcgcctttgc 180 
agacagagat ggtggagctg gtgcccaatg gcaaacactc agaggggctg ctcccggtca 240 
tcacccccat ggcaggcaac cagagggtcg aggaccctgc aeggagctgt atggagggca 300 
agagcttcct acagaaaagt cccagcaagg agccacactt cactgacttc gaggggaaga 360 
catcattegg gatgtcagtg ttcaacctca geaatgecat catgggcagc ggcatcctgg 420 
gactcgccta tgccatggcc aatacgggca ttatcctttt cctgttcctg ttgacagctg 480 
tcgccttgct ctccagctac tccatccacc tgctactcaa gtcctcaggg gtcgtgggca 540 
tccgtgccta tgagcagctg ggctaccgtg cctttgggac cccaggaaag ctggcagcag 600 
ccctggccat cacgctccag aacateggag ccatgtccag ctacctgtac atcatcaagt 660 
ctgagctgcc acttgtcata cagaccttcc tgaacctgga ggagaaaacc tcggactggt 720 
acatgaaegg gaactacctg gtaatccttg tctctgtcac catcattctg cccctggcac 780 
tgatgeggea gcttggctac ctgggctact ccagcggctt ctctcttagc tgcatggtgt 840 
tcttcctaat tgeagtcate tacaaaaagt tccacgtgcc ctgcccactg ccccccaact 900 
tcaacaacac cacaggcaac ttcagccacg tggagatcgt gaaggagaag gtgcagctgc 960 
aggtcgagee tgaggcttca gccttctgca ctcccagcta cttcacgctc aactcacagg 1020 
ttctgacagg tcagggcaag gcgggggccc aatgagagtg gcagactgcc ttgacacagc 1080 
cccgtgtttc cccagacagc atacaccatc cccatcatgg ccttcgcctt cgtctgccac 1140 
cccgaggtgc tgcccatcta tactgagctc aaggacccct ccaagaagaa gatgeagcac 1200 
atctccaacc tgtccatcgc tgtcatgtac atcatgtact tcctggctgc cctcttcggc 1260 
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tacctcacct tctacaacgg ggtggagtcg 
ccgtttgacg tcctgatcct gtgtgtgcgc 
gtgcccatcg ttctgttccc ggtgcgccgc 
gagttcagct ggctgcggca tgtgcttatt 
ctggtcatct ttgcccccaa catcctgggc 
ccattcctca tcttcatctt ccctgccatc 
gagcctgcaa gatccacccc caaaatcctg 
ctgatgacca tgagcttgag cttcatcatc 
ggaggaaacc actagggtga ccctcatcct 
cagactcttc agcccctgct cccatccagt 
aacactgtgg cattcagcca ggccccatgt 
caggaccaag gcccttgggc cactaccctg 
aactgggagc agggtagggc tgtcgccttt 
tttgcacaga tg 



gagctgctgc acacctacag caaggtggac 1320 
gtggccgtgc tgacagcagt cacgctcaca 1380 
gccatccagc agatgctgtt tccaaaccag 1440 
gccgttggcc tgctcacttg tatcaacctg 1500 
atctttgggg tcatcggtgc cacatctgcc 1560 
ttctacttcc gaatcatgcc cacggagaag 1620 
gccctgtgtt ttgctatgct tggcttcttg 1680 
attgactggg cctcagggac cagccggcat 1740 
gttctgtcta ctcaccctag cagccctgcc 1800 
ggccagtcgg gggaggagaa agacgcgatt 1860 
cctctctgtg gaaggttttt gttcaagagc 1920 
ctaggctctg gagctgtaga ggcttcctga 1980 
agatcccgcc caagccccct cattccctcc 2040 

2052 



<210> 118 

<211> 5056 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7510984CB1 

<400> 118 

cgcgcggagc cagcggagcc agctgagccc 
cctggccttc tgcggcagcg agaaccactc 
caacaacggc tgctttgtgg acgcgctcaa 
caccttcccc atcctcttca ttggatgggg 
cagcacatgg cttcatttcc ctgggcacaa 
cttcgtcctg gtgtgtgaga ttgcagaggg 
ccatctgcac ctgtacatgc cagccgggat 
ctactatcac aacatcgaga cttccaactt 
ttggaccctg gccttcatca ccaagaccat 
cggcttctcg cagctacgct tctgcctcac 
gctcctcgtg gaggtcaatg tcatcagggt 
ggaggtgaag cctcccgagg acctgcaaga 
gaatctgctg tccaaaggca cctactggtg 
gaagcccatc gacttgcgag ccatcgggaa 
ctaccaacgg ctctgcgagg cctttgacgc 
aggtgcccgg gccatctggc aggcactcag 
cagcactttc cgcatcttgg ccgacctgct 
gatcgtggac caccttggga aggagaacga 
ggtttacttt gtctcatccc aagagttcct 
gttccttgcc ctcctactgc aaaggacatt 
aactggaatt aacttgagag gagcaataca 
gtccacctcc aacctgtcca tgggagaaat 
catcgacacc aatcagctca tgtggttttt 
agtacagatc attgtgggtg tgattctcct 
tggagcagct gtcatcattc tactggctcc 
tcaggcccag cggagcacac tggagtattc 
gctccgcggc atcaagctgc tgaagctgta 
ggagacgacc cgcaggaagg agatgaccag 
ctccattttc atgaacacgg ccatccccat 
cgtcagcttc ttcaaagagg ccgacttctc 
cttccatatc ttggtcacac cgctgttcct 



gagcccagcc cgcgcccgcg ccgccatgcc 60 
ggccgcctac cgggtggacc agggggtcct 120 
cgtggtgccg cacgtcttcc tactcttcat 180 
aagtcagagc tccaaggtgc acatccacca 240 
cctgcggtgg atcctgacct tcatgctgct 300 
catcctgtct gatggggtga ccgaatccca 360 
ggcgttcatg gctgctgtcg cctccgtggt 420 
ccccaagctg ctaattgccc tgctggtgta 480 
caagtttgtc aagttcttgg accacgccat 540 
agggctgctg gtgatcctct atgggatgct 600 
gaggagatac atcttcttca agacaccgag 660 
cctgggggta cgcttcctgc agcccttcgt 720 
gatgaacgcc ttcatcaaga ctgcccacaa 780 
gctgcccatc gccatgaggg ccctcaccaa 840 
ccaggtgcgg aaggacattc agggcactca 900 
ccatgccttc gggaggcgcc tggtcctcag 960 
gggcttcgcc gggccactgt gcatctttgg 1020 
cgtcttccag cccaagacac aatttctcgg 1080 
tgccaatgcc tacgtcttag ctgtgcttct 1140 
tctgcaagca tcctactatg tggccattga 1200 
gaccaagatt tacaataaaa ttatgcacct 1260 
gactgctgga cagatctgta atctggttgc 1320 
cttcttgtgc ccaaacctct gggctatgcc 1380 
ctactacata ctcggagtca gtgccttaat 1440 
tgtccagtac ttcgtggcca ccaagctgtc 1500 
caatgagcgg ctgaagcaga ccaacgagat 1560 
cgcctgggag aacatcttcc gcacgcgggt 1620 
cctcagggcc tttgccatct atacctccat 1680 
tgcagctgtc ctcataactt tcgtgggcca 1740 
gccctccgtg gcctttgcct ccctctccct 1800 
gctgtccagt gtggtccgat ctaccgtcaa 1860 
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agctctagtg agcgtgcaaa agctaagcga gttcctgtcc agtgcagaga tccgtgagga 1920 
gcagtgtgcc ccccatgagc ccacacctca gggcccagcc agcaagtacc aggcggtgcc 1980 
cctcagggtt gtgaaccgca agcgtccagc ccgggaggat tgtcggggcc tcaccggccc 2040 
actgcagagc ctggtcccca gtgcagatgg cgatgctgac aactgctgtg tccagatcat 2100 
gggaggctac ttcacgtgga ccccagatgg aatccccaca ctgtccaaca tcaccattcg 2160 
tatcccccga ggccagctga ctatgatcgt ggggcaggtg ggctgcggca agtcctcgct 2220 
ccttctagcc gcactggggg agatgcagaa ggtctcaggg gctgtcttct ggagcagcag 2280 
ccttcctgac agcgagatag gagaggaccc cagcccagag cgggagacag cgaccgactt 2340 
ggatatcagg aagagaggcc ccgtggccta tgcttcgcag aaaccatggc tgctaaatgc 2400 
cactgtggag gagaacatca tctttgagag tcccttcaac aaacaacggt acaagatggt 2460 
cattgaagcc tgctctctgc agccagacat cgacatcctg ccccatggag accagaccca 2520 
gattggggaa cggggcatca acctgtctgg tggtcaacgc cagcgaatca gtgtggcccg 2580 
agccctctac cagcacgcca acgttgtctt cttggatgac cccttctcag ctctggatat 2640 
ccatctgagt gaccacttaa tgcaggccgg catccttgag ctgctccggg acgacaagag 2700 
gacagtggtc ttagtgaccc acaagctaca gtacctgccc catgcagact ggatcattgc 2760 
catgaaggat ggcaccatcc agagggaggg taccctcaag gacttccaga ggtctgaatg 2820 
ccagctcttt gagcactgga agaccctcat gaaccgacag gaccaagagc tggagaagga 2880 
gactgtcaca gagagaaaag ccacagagcc accccagggc ctatctcgtg ccatgtcctc 2940 
gagggatggc cttctgcagg atgaggaaga ggaggaagag gaggcagctg agagcgagga 3000 
ggatgacaac ctgtcgtcca tgctgcacca gcgtgctgag atcccatggc gagcctgcgc 3060 
caagtacctg tcctccgccg gcatcctgct cctgtcgttg ctggtcttct cacagctgct 3120 
caagcacatg gtcctggtgg ccatcgacta ctggctggcc aagtggaccg acagcgccct 3180 
gaccctgacc cctgcagcca ggaactgctc cctcagccag gagtgcaccc tcgaccagac 3240 
tgtctatgcc atggtgttca cggtgctctg cagcctgggc attgtgctgt gcctcgtcac 3300 
gtctgtcact gtggagtgga cagggctgaa ggtggccaag agactgcacc gcagcctgct 3360 
aaaccggatc atcctagccc ccatgaggtt ttttgagacc acgccccttg ggagcatcct 3420 ' 
gaacagattt tcatctgact gtaacaccat cgaccagcac atcccatcca cgctggagtg 3480 
cctgagccgc tccaccctgc tctgtgtctc agccctggcc gtcatctcct atgtcacacc 3540 
tgtgttcctc gtggccctct tgcccctggc catcgtgtgc tacttcatcc agaagtactt 3600 
ccgggtggcg tccagggacc tgcagcagct ggatgacacc acccagcttc cacttctctc 3660- 
acactttgcc gaaaccgtag aaggactcac caccatccgg gccttcaggt atgaggcccg 3720 
gttccagcag aagcttctcg aatacacaga ctccaacaac attgcttccc tcttcctcac 3780 
agctgccaac agatggctgg aagtccgaat ggagtacatc ggtgcatgtg tggtgctcat 3840, 
cgcagcggtg acctccatct ccaactccct gcacagggag ctctctgctg gcctggtggg 3900 
cctgggcctt acctacgccc taatggtctc caactacctc aactggatgg tgaggaacct 3960 
ggcagacatg gagctccagc tgggggctgt gaagcgcatc catgggctcc tgaaaaccga 4020 
ggcagagagc tacgaggggc tcctgggtga gaggctcagg gagaggggtg gggaagagtc 4080 
caaggaggag tgtgtctggg ttgggggcca caagggagcc tggggatggg gtggcacttt 4140 
cggatactag ctgtggccca tgcctggtgg ctgagcccag cccggccccc agcaccatcg 4200 
ctgatcccaa agaactggcc agaccaaggg aagatccaga tccagaacct gagcgtgcgc 4260 
tacgacagct ccctgaagcc ggtgctgaag cacgtcatgc cctcatcgcc ctggacagaa 4320 
gatcgggatc tgcggccgca ccggcagtgg gaagtcctcc ttctctcttg ccttcttccg 4380 
catggtggac acgttcgaag ggcacatcat cattgatggc attgacatcg ccaaactgcc 4440 
gctgcacacc ctgcgctcac gcctctccat catcctgcag gaccccgtcc tcttcagcgg 4500 
caccatccga tttaacctgg accctgagag gaagtgctca gatagcacac tgtgggaggc 4560 
cctggaaatc gcccagctga agctggtggt gaaggcactg ccaggaggcc tcgatgccat 4620 
catcacagaa ggcggggaga atttcagcca gggacagagg cagctgttct gcctggcccg 4680 
ggccttcgtg aggaagacca gcatcttcat catggacgag gccacggctt ccattgacat 4740 
ggccacggaa aacatcctcc aaaaggtggt gatgacagcc ttcgcagacc gcactgtggt 4800 
caccatcgcg catcgagtgc acaccatcct gagtgcagac ctggtgatcg tcctgaagcg 4860 
gggtgccatc cttgagttcg ataagccaga gaagctgctc agccggaagg acagcgtctt 4920 
cgcctccttc gtccgtgcag acaagtgacc tgccagagcc caagtgccat cccacattcg 4980 
gaccctgccc atacccctgc ctgggttttc taactgtaaa tcacttgtaa ataaatagat 5040 
ttgattattt cctaaa cn _^ 
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